107506630 




THIS IS TO CERTIFY THAT ANNEXED HERETO IS A TRUE COPY FROM 
THE RECORDS OF THE UNITED STATES PATENT AND TRADEMARK 
OFFICE OF THOSE PAPERS OF THE BELOW IDENTIFIED PATENT 
APPLICATION THAT MET THE REQUIREMENTS TO BE GRANTED A 
FILING DATE. 

APPLICATION NUMBER: 10/091,244 
FILING DATE: March 04, 2002 

RELATED PCT APPLICATION NUMBER: PCT/US03/06910 
'Xffi > • fe\ sBy Authority of the 

Sfe;^ W ^COMMISSIONER OF PATENTS AND TRADEMARKS ■ 



M. TARVER 
Certifying Officer 




PRIORITY 
DOCUMENT 

SUBMITTED OR TRANSMITTED IN 
COMPLIANCE WITH RULE 17.1(a) OR (b) 




, ^flease type a plus sign (+) inside this box -> ["+] 



PTOt/SB/05 (11-CO) >I 



^ Approved for use through 1Q/31/0Z OMBQB51-GQ32 rs 

^^7~ US ' Patent and TrademadgMfce: U.S. DEPARTMENT OF COMMERCE J \ 

Under the Paperwork Reduction B95, no persons are required to respond to a collection of mformath^^^s it displays a valid OMB control number. 



\ 



iO 



UTILITY 
PATENT APPLICATION 
TRANSMITTAL 



(Only for new nonprovisionai applications under 37 CFR 1.53(b)) 



Attorney Docket No. 



First inventor 



We 



30082200462) 



Rajesh S. Gokhale, etai 



METHODS TO MEDIATE POLYKETIDE SYNTHASE 
MODULE EFFECTIVENESS 



Express Mail Label No. J EL 796955920 US 





Express MaH Label No.: EL 796955920 US 

1 hereby certify that this paper or fee Is being deposited with Ununited Sta 
under 37 C.F.R. § 1.10 on the date indicated above and id addressed tort, 


Date of Deposit: March 4, 2002 

tes Postal Service "Express MaD Post Office to Addressee" service 
ssfetant Commissioner for Patents, Washington, D.C. 20231. 


VicW ifi&i 


news 






APPLICATION ELEMENTS 

See MPEP chapter 600 concerning utility patent application contents. 


Assistant Commissioner for Patents 
ADDRESS TO: Box Patent Application 
Washington, DC 20231 



1- S 

a 2 - a 

Cts. a 

Cl 

h 



5.n 



e 



4 

Ci 



Fee Transmittal Form (e.g. PTO/SB/17) 

(Stroma en original, and a duplicate for too processing) 

Applicant claims small entity status. 

Sea 37 CFR 1.27. 

Specification {Total Pages I 61 | ] 

(preferred arrangement set forth below) 

- Descriptive title of the Invention 

• Cross Reference to Related Applications 

- Statement Regarding Fed sponsored R&D 

- Reference to sequence listing, a table, ore 
computer program listing appendix 

• Background of the Invention 

- Brief Summary of the Invention 

- Brief Description of the Drawings (ff filed) 

- Detailed Description 
-Clalmts) 

- Abstract of the Disclosure 



8. 



CD-ROM or CD-R in duplicate, large table or Computer 
Program (Appendix) 

Nucleotide and/or Amino Acid Sequence Submission 

(if applicable, eU necessary) 

HI Computer Readable Form (CRF) 



i. O CD-ROM or CD-R (2 copies); or 
iL □ paper 

PI Statements verifying identify of above copies 



[Total Sheets 



22 



31 



6. 



m 



Drawing(s) (35USC113) 
Oath or Declaration Uotal Pages I I ] 

a □ Newly executed (original or copy) 

h j— I Copy from a prior application {37 CFR 1 .63(d)) 

LJ (for conSnvationAtivtstonat w3h Box 18 completed) 

I n DELETION OF INVENTORfS) 

LJ Sfgned statement attached deleting inventors) named In the 

prior application, see 37 CFR 1 .63(d)(2) and 1 .33(b) 

Application Data Sheet See 37 CFR 1.76 



ACCOMPANYING APPLICATION PARTS 



9.0 

10. □ 

11. □ 

13. □ 
14.0 

15. O 

16. □ 

17. □ 



37 CFR 3.73(b) Statement 

(where there Is an assignee) 



Power of Attorney 



Copies of IDS 
Citstiofis 



Assignment Papers (cover sheet & documents)) 

□ 

English Translation document (if applicable) 

Information Disclosure Statement j — I 
(lDSyPTO-1449 L- 1 

Preliminary Amendment 

Return Receipt Postcard (MPEP 503) 
Should be specifically Karrrfzed) 

Certified Copy of Priority Documents) 
(If foreign priority Is claimed) 

Request and Certification under 35 U.S.C 1 22(b)(2)(B)(i). Applicant 
must attach form PTO/SB/35 or its equivalent 

Other 



1 8. If a CONTINUING APPLICATION , check appropriate box and supply the requlsSe information below and fn a preliminary amendment, or in an Application Data Sheet 
under37CFR1.76: 

□ Continuation □ Divisional B Continuation-in-part (ClP) of prior application No: 09/500,747 



Prior application Information; 



Examiner Kathleen Kerr 



Group /Art Unit 1652 



For CONTINUATION OR DIVISIONAL. APRS only: The entire disclosure of the prior application, from which an oath or declaration Is supplied under Box 5b, is considered a part of the 
disclosure of the accompanying continuation or divisional application and Is hereby Incorporated by reference. This Incorporation cancnty be railed upon when a portion has been 
Inadvertently omitted from the submitted application parts. 



19. CORRESPONDENCE ADDRESS 



Customer Number or Bar Code Label 




or Q Correspondence address below 



Address 



Country 



(insert Customer No. or Attach barcode label here) 



Morrison & Foereter LLP 



Brenda J. Wajjach 



381 1 Valley Centre Drive. Sidle 500 



San Otego 



USA 



Tip Code 02130-2332 



Fax 8SB-720-S12S 



Name (Print/Type) 



JSJgnatuns 



Brenda 



ida JUWallach j j Registration No. (Attorney// 



'Agent) 



45.193 



Date March 4, 2002 



sd-81314 



PTO/SB/17 (03-00) 
Approved for use through 10731/2002. OMB 0651-0032 
H» QIRUi. Ua r OCPARTT.tCWT ' OPCO t . ! 



UndertheF 



i Pape^^jKerJuction 

FEE TRA^MITTAL 
FOR FY 2002 



Patent fees are subject to annua! revision. 



Act of 1995. no persons are required to respond to a coQsdion of 



unless It displays a valid OMB control number? 
ete If Known 



Application Number 



FEng Date 



Rrst Named tnventor 



Examiner Name 



Group Art Unit 



To be assigned 



Rajesh GOKHALE 



To be assigned 



To be assigned 



TOTAL AMOUNT OF PAYMENT 



($) 388.00 



Attorney Docket No. 



METHOD OF PAYMENT 



FEE CALCULATION (continued) 



1 • Fxl ^ e Commlssfoner is hereby authorized to charge indicated 
' m fees and credit any overpayments to: 
Deposit i I 

Account 03-1952 

Number 1 1 

Deposit 
Accou 
Name 



3. ADDITIONAL FEES 

Large Entity Small Entity 

Fee Fee Fee Fee 

Code ($) Code <$) 



Deposit i 1 

Account Morrison & Foerster llp I 

Maim 1 y 



FEE CALCULATION 



1. BASIC FILING FEE 



Charge Any Additional Fee Required Under 37 CFR 1.18 and 1.17 
Applicant claims small entity status. See 37 CFR 1 27 
2. Q Payment Enclosed: 
f|D Check □ Credit Card □ Money Order □ Other 

V 

% 

4 
4 

% 
U 

e 

4 
C 
R 



Coda 


Entity 

Fes 

(« 


Small 

Fee 

Coda 


Entity 

Fee 

(*> 


Fee Description 


Feo 
Paid 


101 


740 


201 


370 


Utility filing fee 


370 


106 


330 


208 


165 


Design Cling fee 




107 


510 


207 


255 


Plant filing fee 




108 


740 


208 


370 


Reissue filing fee 




114 


160 


214 


60 


Provisional filing fee 





SUBTOTAL <1) I (8)370X0 I 



2. EXTRA CLAIM FEES 

22-20** 



Total Claims 



Foo from 
low 

18 



Claims 
Multiple Dependent 



-3- « 



Fes Paid 

- $18 

- $ 
" $ 



105 
127 
139 
147 
112 

113 
115 
116 

117 

118 

128 
119 
120 
121 
138 

140 
141 

142 
143 

144 

122 
123 
126 



130 

50 

130 

2,520 

920* 

1,840* 

110 

400 



1.960 
320 
320 
280 

1.510 

110 
1.280 

1,260 
460 



130 



180 



205 
227 
139 
147 
112 

113 
215 
216 

217 

218 

228 
219 
220 
221 

138 

240 
241 

242 
243 

244 

122 

123 

126 



25 
130 
2,520 
920* 

1,840* 

55 

200 

460 

720 

980 
160 
160 
140 

1,510 

55 
640 

640 
230 

310 

130 

50 

160 



Fee Description 

Surcharge - tale filing fee or oath 

Surcharge - late provisional filing 
fee or cover sheet 

Non-English specHlcaflon 

For filing a request for ex parte 
reexamination 

Requesting publication of SIR prior 
to Examiner action 
Requesting publication of SIR after 
Examiner action 

Extension for reply within first month 
Extension for reply within second 



Fee Paid 



Extension for reply within third 



Extension for reply within fourth 
month 

Extension for reply within fifth month 
Notice of Appeal 

Filing a brief in support of an appeal 
Request for oral hearing 
Petition to institute a public use 
proceeding 

Petition to revive - unavoidable 
Petition to revive - unintentional 

Utility issue fee (or reissue) 
Design issue fee 

Plant issue fee 

Petitions of (he Commissioner 
Petitions related to provisional 



Largs 

Fee 

Code 


Entity 

Fee 

{$) 


Sman 

Fee 

Code 


Entity 
Fee 

cs> 


Fee Description 


581 


40 


581 


40 


103 


18 


203 


9 


Claims tn oxeoss of 20 


146 


740 


246 


370 


102 


84 


202 


42 


Independent claims In excess of 3 


149 


740 


249 


370 


104 


280 


204 


140 


Multiple dependent claims, If not paid 


179 


740 


279 


370 


109 


84 


209 


42 


••Reissue Independent claims over 
cristas! patent 


169 


900 


169 


600 


110 


18 


210 


9 


"Reissue dates In excess of 20 and 
over original patent 











Submission of Information 
Disclosure Stmt 

Recording each patent assignment 
per properties (limes number of 
properties) 

Filing a submission after final 
rejection 

(37 CFR § 1.129(a)) 

For each additional Invention to be 

examined (37 CFR § 1.129(b)) 

Request for Continued Examination 

<RCE) 

Request for expedited examination 
of a design application 



SUBTOTAL (2) | (6)18,00 | 



Other fee (specify) 



* or number previously paid. If greater; For reissues, see above. 



"Reduced by Basic Filing Fee Paid 



SUBTOTAL (3) 



($) 



SUBMITTED BY 



Complete (if applicable) 



Name (Print/Type) 



Brenda J. WaJlach 




Registration No. 
(Attorney/Agent) 



I 



45,193 



Telephone 



(858) 720-7981 



Signature 



Dale 



March 4. 2002 



WARNING: Information onihls form may become public' Credit card ti 
bo included on this fqrny Provide credit card Information and authorization on PTO<2038. 

Bunlen Hour Statement Thb tonn la estimated to talcs Q.2 bouts to complete. Time wffl vary depending, upon tho nccda of tiro Individual case. Any comments on the a mount af lime yoa are required to complete this form 
Stolid be sent to the Chief Information Officer. U.S. Patent and Trademark Office, W35Wngtan. DC 20231. DO NOT SEN D FEES OR COMPLETED FORMS TO THIS ADDRESS. SEND TO: Contrnlsstoner far Patents, 
Washington, DC 20231. 

sd-75071 



Application Data Sheet 



Q 

m 

o 

■w 



o 



Inventor Information 

Inventor One Given Name: 

Family Name: 

Name Suffix: 

Postal Address Line One 

Postal Address Line Two 

City: 

State or Province: 
Postal or Zip Code: 
Citizenship Country: 

Inventor One Given Name: 

Family Name: 

Name Suffix: 

Postal Address Line One 

Postal Address Line Two 

City: 

State or Province: 
Postal or Zip Code: 
Citizenship Country: 

Inventor One Given Name: 

Family Name: 

Name Suffix: 

Postal Address Line One 

Postal Address Line Two 

City: 

State or Province: 
Postal or Zip Code: 
Citizenship Country: 

Correspondence Information 

Name Line One: 
Name Line Two: 
Address Line One: 
Address Line Two: 
City: 

State or Province: 
Postal or Zip Code: 
Telephone: 
Fax: 

Electronic Mail: 



Rajesh 
Gokhale 



Delphi 
India 

India 

Stuart 
Tsuji 



Stanford 
CA 

Unknown 

Chaitan 
Khosia 



Palo Alto 
CA 

India 



Brenda J, Wallach 

Morrison & Foerster LLP 

381 1 Valley Center Drive 

Suite 500 

San Diego 

California 

92130 

858-720-7961 
858-720-5125 



^Application Information 

Title Line One: METHODS TO MEDIATE POLYKETIDE 

Title Line Two: SYNTHASE MODULE EFFECTIVENESS 

Total Drawing Sheets: 22 

Formal Drawings?: No 

Application Type: Utility 

Docket Number 30062-20046.20 

Representative Information 

Representative Customer Number 25225 

Continuity Information 

M- This application is a: Continuation-in-part 

O > Application One: S/N 09/500,747 

g Filing Date: February 9, 2000 

u 

jy which is a: 

U »Application Two: 
Filing Date: 

O which is a: 

y »>Application Three: 

© Filing Date: 

O Prior Foreign Applications 

Foreign Application One: 
Filing Date: 
Country: 
Priority Claimed: 



A T E N T 

Dkt: 300622004620 



METHODS TO MEDIATE POLYKEUDE SYNTHASE 



MODULE EFFECTIVENESS 



Cross-Reference to Related Applications 



[0001]. This patent application is a continuation-in-part application of copending U.S. 
patent application 09/500,747, filed 9 February 2000; which claims the benefit of the 
filing date of U.S. Provisional Application No. 60/119,363, filed 9 February 1999. 
Further, this application claims the benefit of the filing date of U.S. Provisional 
Application Nos. 60/272,985 and 60/272,987, both filed 2 March 2001. Each of these 
applications is incorporated herein by reference. 

Statement of Rights to Inventions Made Under Federally Sponsored Research 
[0002] The invention herein was made, at least in part, based on support by grants 
CA-66736, GM-22172, and GM-22176 from the National Institutes of Health, and grant 
BES-9806774 from the National Science Foundation. The U.S. government may have 
certain rights in the invention. 

Technical Field 

[0003] The invention is directed to facilitating usage by polyketide synthase modules 
of nascent polyketide chains. Specifically, the invention concerns including intermodule 
and intramodule linkers in constructions for synthesis of desired polyketides. More 
specifically, the invention concerns the effects of protein-protein interactions and 
enzyme-substrate interactions in the channeling of intermediates between polyketide 
synthase modules. 

Background of the Invention 

[0004] Polyketides are a class of compounds synthesized from 2-carbon units through 
a series of condensations and subsequent modifications. Polyketides occur in many types 
of organisms, including fungi and mycelial bacteria, in particular, the actinomycetes. 
Polyketides are biologically active molecules with a wide variety of structures, and the 
class encompasses numerous compounds with diverse activities. Tetracycline, 
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erythromycin, epothilone, FK-506, FK-520, narbomycin, picromycin, rapamycin, 
spinocyn, and tylosin are examples of polyketides. Given the difficulty in producing 
polyketide compounds by traditional chemical methodology, and the typically low 
production of polyketides in wild-type cells, there has been considerable interest in 
finding improved or alternate means to produce polyketide compounds. 
[0005] The biosynthetic diversity of polyketides is generated by repetitive 
condensations of simple monomers by polyketide synthase (PKS) enzymes that mimic 
fatty acid synthases. For instance, the deoxyerythronolide-B synthase catalyzes the chain 
extension of a primer with several methylmalonyl coenzyme A (MeMalCoA) extender 
units to produce the erythromycin core. 

[0006] The cloning, analysis, and recombinant DNA technology of genes that encode 
PICS enzymes allows one to manipulate a known PKS gene cluster either to produce the 
polyketide synthesized by that PKS at higher levels than occur in nature or in hosts that 
otherwise do not produce the polyketide. The technology also allows one to produce 
molecules that are structurally related to, but distinct from, the polyketides produced from 
known PKS gene clusters. See, e.g 9 PCT publication Nos. WO 93/13663; 95/08548; 
96/40968; 97/02358; 98/27203; and 98/49315; United States Patent Nos. 4,874,748; 
5,063,155; 5,098,837; 5,149,639; 5,672,491; 5,712,146; 5,830,750; and 5,843,718; and 
Fu, et aU 1994, Biochemistry 33: 9321-9326; McDaniel, et aL, 1993, Science 262: 
1546-1550; andRohr, 1995, Angew. Chem. Int. Ed. Engl 34(8): 881-888, each of which 
is incorporated herein by reference. 

[0007] PKSs catalyze the biosynthesis of polyketides through repeated, 
decarboxylase Claisen condensations between acylthioester building blocks. The 
building blocks used to form complex polyketides are typically acylthioesters, such as 
acetyl, butyryl, propionyl, malonyl, hydroxymalonyl, methylmalonyl, and ethylmalonyl 
CoA. Two major types of PKS enzymes are known; these differ in their composition and 
mode of synthesis of the polyketide synthesized. These two major types of PKS enzymes 
are commonly referred to as Type I or "modular" and Type EL "iterative" PKS enzymes. 
[0008] The present invention concerns modular PKS. In the Type I or modular PKS 
enzyme group, a set of separate catalytic active sites (each active site is termed a 
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"domain", and a set thereof is termed a "module 11 ) exists for each cycle of carbon chain 
elongation and modification in the polyketide synthesis pathway. The typical modular 
PKS is composed of several large polypeptides, which can be segregated from amino to 
caiboxy terminii into a loading module, multiple extender modules, and a releasing (or 
thioesterase) domain. The PKS enzyme known as 6-deoxyerythronolide B synthase 
(DEBS) is a typical Type I PKS. In DEBS, there is a loading module, six extender 
modules, and a thioesterase (TE) domain. The loading module, six extender modules, 
and TE of DEBS are present on three separate proteins (designated DEBS-1, DEBS-2, 
and DEBS-3, with two extender modules per protein). Each of the DEBS polypeptides is 
encoded by a separate open reading frame (ORF) or gene; these genes are known as 
eryAI, eryAII, and eryAUI. See Figure 1 . There is considerable interest in the genetic and 
chemical reprogramming of modular PKSs (see, e.g. 9 Khosla, 1997, CJiem. Rev. 97:2577- 
2590, and Staunton, et al 9 1997, Chem. Rev. 261 1-2629, each of which is incorporated 
herein by reference). 

[00091 . Generally, the loading module is responsible for binding the first building 
block used to synthesize the polyketide and transferring it to the first extender module. 
The loading module of DEBS consists of an acyltransferase (AT) domain and an acyi 
carrier protein (ACP) domain. Another type of loading module utilizes an inactivated 
KS, an AT, and an ACP. This inactivated KS is in some instances called KS Q , where the 
superscript letter is the abbreviation for the amino acid, glutamine, that is present instead 
of the active site cysteine required for ketosynthase activity. In other PKS enzymes, 
including the FK-520 PKS, the loading module incorporates an unusual starter unit and is 
composed of a Co A ligase activity domain. In any event, the loading module recognizes 
a particular acyl-CoA (usually acetyl or propionyl but sometimes butyryl) and transfers it 
as a thiol ester to the ACP of the loading module. 

[0010] The AT on each of the extender modules recognizes a particular extender- 
CoA (malonyl or alpha-substituted malonyl, z.e., methylmalonyl, ethylmalonyl, and 
hydroxymalonyl) and transfers it to the ACP of that extender module to form a thioester. 
Each extender module is responsible for accepting a compound from a prior module, 
binding a building block, attaching the building block to the compound from the prior 
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module, optionally performing one or more additional functions, and transferring the 
resulting compound to the next module. The transfer into a module is mediated by the 
KS domain which is upstream of the remaining catalytic domains. The additional 
functions are performed by enzymes which comprise a ketoreductase (KR) which reduces 
the carbonyl group generated from the condensation to an alcohol, a dehydratase (DH) 
which converts the alcohol to a double bond, and an enoyl reductase (ER) which reduces 
the double bond to a single bond. These catalytic domains appear to be immediately 
adjacent and not separated by any linking sequences. Collectively, they can be called 
"beta-carbonyl modifying" domains. Thus, a particular module may contain none of 
these activities, only KR, or KR+DH, or KR+DH+ER. Thus, the order of domains from 
the N-terminus of a particular module is KS, AT, beta-carbonyl modifying domains (if 
present), ACP. The order, N-* C of the beta-carbonyl modifying enzymes is DH ER KR. 
[0011] Thus, each extender module of a modular PKS contains zero, one, two, or 
three enzymes that modify the beta-carbon of the growing polyketide chain downstream 
of the AT catalytic domain. A typical (non-loading) minima] Type I PKS extender 
module is exemplified by extender module 3 of DEBS, which contains only a KS 
domain, an AT domain, and an ACP domain. The next extender module, module 4, 
contains all three beta-carbonyl modifying enzymes. (The beta-carbonyl modifying 
enzymes effect such modification on the extender unit that has been added by the 
previous module.) 

[0012] Once the PKS is primed with acyl- and malonyl-ACPs, the acyl group of the 
loading module migrates to form a thiol ester (trans-esterification) at the KS of the first 
extender module; at this stage, extender module one possesses an acyl-KS adjacent to a 
malonyl (or substituted malonyl) ACP. The acyl group derived from the loading module 
is then covalently attached to the alpha-carbon of the malonyl group to form a carbon- 
carbon bond, driven by concomitant decarboxylation, and generating a new acyl- ACP 
that has a backbone two carbons longer than the loading building block (elongation or 
extension). 

[0013] After traversing the final extender module, the polyketide encounters a 
releasing domain that cleaves the polyketide from the PKS and typically cyclizes the 
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polyketide. For example, final synthesis of 6-dEB is regulated by a TE domain located at 



cyclization of the macrolide ring by formation of an ester linkage. In FK-506, FK-520, 
rapamycin, and similar polyketides, the ester linkage formed by the TE activity is 
replaced by a linkage formed by incorporation of a pipecolate acid residue. The 
enzymatic activity that catalyzes this incorporation for the rapamycin enzyme is known 
as RapP, encoded by the rapP gene. The polyketide can be modified further by tailoring 
enzymes; these enzymes add carbohydrate groups or methyl groups, or make other 
modifications, i.e., oxidation or reduction, on the polyketide core molecule. For example, 
6-dEB is hydroxylated at C6 and C12 and glycosylated at C3 and C5 in the synthesis of 
erythromycin A. 

[0014] In PKS polypeptides, the regions that encode enzymatic activities (domains) 
are separated by linker or "scaffold"-encoding regions. These scaffold regions encode 
amino acid sequences that space the domains at the appropriate distances and in the 
correct order. Thus, the linker regions of a PKS protein collectively can be considered to 
encode a scaffold into which the various domains (and thus modules) are placed in a 
particular order and spatial arrangement. Generally, this organization permits PKS 
catalytic domains of different or identical substrate specificities to be substituted (usually 
at the DNA level) between PKS enzymes by various available methodologies. Thus, 
there is considerable flexibility in the design of new PKS enzymes with the result that 
known polyketides can be produced more effectively, and novel polyketides useful as 
pharmaceuticals or for other purposes can be made. 

[0015] PCT publication WO 98/49315, the contents of which are incorporated herein 
by reference, describes an approach for modifying the enzymatic activities included 
within modules of a PKS by maintaining the scaffolding intact but replacing catalytic 
domains with different catalytic domains. U.S. Serial No. 09/346,860 filed 2 July 1999 
and the corresponding PCT publication WO 00/01838, also filed on that date, and 
incorporated herein by reference describe alternative methods by altering the 
hypervariable region of the AT domains so as to alter the specificity for an extender unit 
and alteration of the KS domains to control stereochemistry. The present invention takes 



the end of extender module six. In the synthesis of 6-dEB, the TE domain catalyzes 



sd-81206 v2 



-5- 



•PATENT 
Ltty. Dkt: 300622004620 



advantage of the approach of manipulating modules so that the catalytic activities of an 
entire module are placed in the appropriate sequence to construct a desired polyketide. 
The ability to utilize this approach depends on effecting an appropriate means for the 
module to incorporate a growing polyketide chain, which involves assuring that an 
appropriate linker region is included. Since the filing of the provisional application firom 
which the present application claims priority, a related paper has been published by 
Ranganathan, A., et al, Chem. & Biol (1999) 6:731-741. In this paper, intrapolypeptide 
linkages are fortuitously supplied to chimeric modules by including the KS region of the 
native downstream module in a chimera between the corresponding upstream module and 
the portions downstream of the KS domain in a heterologous module. Alternatively, the 
downstream module will include the ACP catalytic domain of the native upstream 
module fused to the remainder of a heterologous module upstream in the chimera. 

Background Information 

[0016] The following articles provide information relating to the invention: Aparicio, 
J. F., et al 9 (1996) Gene 169 9 9-16; Cortes, J., et al 9 (1990) Nature 348 9 176-178; 
Donadio, S., et dL 9 (1991) Science 252 9 675-679; Gokhale, R. S., et al 9 (2000) Curr. 
Opin. Chem. Biol 4 9 22-27. 

Abbreviations 

[0017] 6-dEB: 6-deoxyerythronolide B; ACP: acyl carrier protein; AT: 
acyltransferase; DEBS: 6-deoxyerythronolide B synthase; DH: dehydratase; ER: 
enoylreductase; KR: ketoreductase; KS: ketosynthase; NAC: N-acetylcysteamine; 
NRPS: nonribosomal peptide synthetase; PCP: peptidyl carrier protein; PKS: polyketide 
synthase; ACP: acyl carrier protein; ER: enoylreductase; LDD: loading didomain; TE: 
thibesterase; M2: module 2 of DEBS; M2(4): module 2 with C-tenninal linker from 
module 4; M3.+TE: module 3 fused to thioesterase; (5)M3+TE: module 3 with N- 
terminal linker from module 5; M2:M3: complex of module 2 and module 3; and NDK: 
(2S,3R)-2-methyl-3-hydroxypentanoic acid diketide. 
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Disclosure of the Invention 



[0018] The invention is directed to an efficient method for constructing an arbitrarily 
chosen polyketide synthase, and therefore a desired polyketide, by manipulating entire 
modules of Type I polyketide synthases. The invention enables this approach by 
5 providing the modules with the appropriate "lead-in" or linker sequence to the 

ketosynthase (KS). Applicants have discovered that the appropriate linker between 
modules is required upstream of the relevant KS in order to permit the module to accept 
the nascent polyketide chain, and, in the case of intermolecular transfer, appropriate 
pairing of N-terminal and C-terminal regions assures the appropriate transfer. The nature 
M» 1 0 of this linker varies depending on whether the module is covalently linked downstream 

o 

Q from another module, or whether it forms the N-terminus of the polypeptide. 

^ [0019] Thus, in one aspect, the invention is directed to a method to construct a 

BJ functional polyketide synthase which method comprises providing each module 

3p contained in the desired polyketide synthase with an appropriate intrapolypeptide linker 

^15 (RAL) when said module is downstream in the same polypeptide from a module derived 

Ul from a different PKS and with an appropriate interpolypeptide linker (ERL) when the 

Q 

module is derived from a PKS where the module is the N-terminal module of a 
p polypeptide. If the module at the N-terminus of a polypeptide is to accept a nascent 

polyketide chain from an upstream module, the interpolypeptide linker needs to include 
20 the appropriate amino acid sequence at the C-terminus of the module donating the 

nascent chain. 

[0020] In describing a "module" being provided with linker(s) the term "module" 
refers to the functional portions extending approximately from the N-tenninus of the KS 
catalytic region to the C-terminus of the ACP - £&, excludes the linker portions otherwise 

25 considered a portion of the module. 

[0021] As further described below, any order of modules of desired specificity can be 
assured by providing the appropriate linkers either intermolecularly or intramolecularly. 
Thus, the polyketide synthase can be assembled from individual modules by providing 
the appropriate linkers to assure that the polyketide chain will be passed in the correct 

30 sequence from one module to the next and by assembling these modules either by directly 
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providing the polypeptides containing them or by co-expressing nucleotide sequences and 
coding them in a host celL 

[0022] In other aspects, the invention is directed to materials and compositions useful 
in carrying out the method, in particular to isolated DNA fragments which contain the 
appropriate intrapolypeptide and interpolypeptide linkers. The invention also relates to 
methods to construct functional polyketide synthases from libraries of modules and to 
polyketides prepared by supplying appropriate substrates to reconstructed polyketide 
synthases. The polyketides thus prepared can be "tailored" using either isolated enzymes 
or feeding the polyketides to an organism containing these enzymes to convert them to 
anti-infectives or compounds of other activities such as motolides by such post- 
polyketide modifications as hydroxylation and glycosylation. The ketides or ketolides or 
their modified forms can also be further derivatized using chemical synthetic methods. 
[0023] In other apects, the invention is directed towards the C- and N-terminal ends 
of adjacent PKS polypeptides capped by peptides of 20-40 residues. Mismatched 
sequences abolish intermodular chain transfer without affecting the activity of individual 
modules, whereas matched sequences can facilitate the channeling of intermediates 
between ordinarily non-consecutive modules. 

[0024] In yet another aspect, the invention is directed towards the role of protein- 
protein interactions in substrate channeling and more specifically to assays or methods to 
assess the steady-state kinetic parameters of individual DEBS modules when primed in a 
channeling modes versus a diffusive mode. The diffusive process precludes the 
involvement of the covalent, substrate channeling mechanism by which enzyme-bound 
intermediates are directly transferred from one module to the next in a multi-modular 
PKS. These methods can be used to quantify the kinetic benefit of linker-mediated 
substrate channeling in a modular PKS. 

[0025] In another aspect, the invention is directed towards the ability of a synthetic 
peptide to inhibit tetraketide production. For example, a peptide corresponding to the N- 
tenninal linker of module 3 was synthesized and shown to inhibit the formation of 
tetraketide lactone 2 (as shown in Figure 6) in the presence of M2 and M3+TE in a 
concentration-dependent manner. 
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Brief Description of the Drawing s 

[0026] Figure 1 is a diagram of erythromycin PKS which forms 6-dEB. 
[00271 Figure 2 shows the conversion of a diketide thioester to the triketides 
corresponding to those produced by DEBS-1 of the erythromycin PKS. 
[0028] Figure 3 shows sequences of intrapolypeptide and interpolypetide linkers 
derived from various Type I PKS. 

[0029] Figure 4 is a schematic diagram of the biosynthesis of 6-dEB by 6- 
deoxyerythronolide B synthase. 

[0030] Figure 5 presents a velocity vs. NDK (mM) plot showing the fit of the 
Michaelis-Menten equation for M3 + TE and (5)M3 + TE alone. 

[0031] Figure 6 is a schematic diagram of the interpolypeptide transfer with matched 
and mismatched linker pairs. 

[0032] Figure 7 provides saturation curves of (A) M2 with M3 + TE and (B) M2(4) 
with(5)M3+TE. 

[0033] Figure 8 shows selective inhibition by a synthetic peptide mimic of the N- 
terminal linker of M3. 

[0034] Figure 9 provides the CD spectrum of a peptide mimic showing the minima at 
208 and 222 nm indicative of as-helical character. 

[0035] Figure 10 presents schematic illustrations of three mechanisms of loading a 
DEBS module with a diketide 

[0036] Figure 1 1 illustrates four diketides and their corresponding, putative 
enzymatic products. 

[0037] Figure 12 presents the reaction schemes of the three bimodular DEBS 
derivatives-Ml+M5+TE (module 1 + module 5 + TE), M1+M6+TE, and M1+M2+TE 
(DEBSl+TE>aad their corresponding kcat values. 

[0038] Figure 13 presents a comparison of the WKM values (min 1 mM" 1 ) of the 
two syn-diketides when presented as acyl-ACP substrates (4a and 4b) vs when presented 
as NAC-thioesters (2a and 2b). 
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[0039] Figure 14 presents a comparison of the kcat values (min 1 ) of the four diketides 
when presented as acyl-ACP substrates (4a-d) vs when presented as NAC-thioesters (2a- 



[0040] Figure 15 presents conditions, results, and proposed mechanisms of back- 
transfer experiments. 

[0041] Figure 16 is a schematic of 6-deoxyerythronolide B synthase. 
[0042] Figure 17 is a schematic of matched and mismatched linker pairs at 
intermodular interfaces. 

[0043] Figure 18 is a schematic illustrating the channeling of intermediates to 
unnatural recipient modules. 

[0044] Figure 19 is a schematic illustrating the replacement of the entire donor 
module with just the acyl carrier protein. 

[0045] Figure 20 is table summarizing the substrates and enzymes used to study 
substrate transfer from a donor to a recipient module. 

[0046] Figure 21 shows data supporting a coiled-coil conformation for linkers. 
[0047] Figure 22 is a schematic illustrating how exogenous peptide mimetics of 
linkers can inhibit chain transfer. 

Detailed Description of the Drawings 

[0048] Figure 1 is a diagram of the erythromycin PKS which forms 6-dEB, the core 
precursor of erythromycin. As shown, the PKS is comprised of three proteins, DEBS-1, 
DEBS-2 and DEBS-3 which are encoded by three genes, commonly called eryAI, eryAII 
and eryAIU. 

[0049] Figure 2 shows the conversion of a diketide thioester to the triketides 
corresponding to those produced by DEBS-1 of the erythromycin PKS. 
[0050] Figure 3 shows the structures of intrapolypeptide linkers and the N-terminal 
portions of interpolypeptide linkers (SEQ ID NOS:3-19) derived from various Type I 



[0051] Figure 4 is a schematic diagram of the biosynthesis of 6-dEB by 6- 
deoxyerythronolide B synthase. Each polypeptide, DEBS1, DEBS2, and DEBS3, 



d). 



PKS. 
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contains two modules, and each module comprises a set of active site domains 
responsible for addition and modification of an extender unit The short "linker" regions 
are located at the N- and C-termini; their shapes exemplify the complementarity 
demonstrated by each pair. 

[0052] Figure 5 presents a velocity vs. NDK (mM) plot showing the fit of the 
Michaelis-Menten equation for M3 + TE and (5)M3 + TE alone. The nearly identical kcat 
values of 0.71 and 0.68 min 1 and K M values of 2.8 and 2.5pM demonstrate the 
interchangeability of the linkers for individual modules. 

[0053J Figure 6 presents a schematic diagram of the interpolypeptide transfer with 
matched and mismatched linker pairs: (A) wild-type linker pair, M2 C-terminus + M3 N- 
terminus; (B) mismatched pair, M2 C-terminus + M5 N-terminus; (C) mismatched pair, 
M4 C-terminus + M3 N-terminus; (D) matched pair, M4 C-terminus + M5 N-tenninus. 
[0054] Figure 7 provides saturation curves of (A) M2 with M3 + TE and (B) M2(4) 
with (5)M3 + TE showing the effect of increasing M3 concentration on the overall rate of 
turnover. From these plots, the saturating rates of 0.27 and 0.74 min 1 were determined, 
as were the K D values of 1.1 and 2.1 . 

[0055] Figure 8 shows selective inhibition by the synthetic peptide mimic of the N- 
terminal linker of M3. The linker lowered the overall rate of tetraketide production in a 
dose-dependent fashion for the transfer using the M2-M3 linker pair. However, it 
demonstrated no such effect when tetraketide production depended upon interpolypeptide 
transfer using the M4-M5 linker pair. 

[0056] Figure 9 provides the CD spectrum of the peptide mimic showing the minima 
at 208 and 222 nm indicative of a-helical character. Though only ca. 50% helical, its 
structural features correlate with the expected coiled-coil motif. Furthermore, its ability 
to selectively inhibit interpolypeptide transfer verified some recognition ability of the 
mimic. 

[0057] Figure 10 presents schematic illustrations of the three mechanisms of loading 
a DEBS module with a diketide. (A) In a diffusive mechanism, diketides that have been 
activated as N-acetylcysteamine thioesters (diastereomers 2a-d) are loaded exogenously 
onto the KS domain. Claisen-like condensation with a C3-unit derived from 
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mefhylmalonyl CoA followed by NADPH-dependent reduction gives the corresponding 
triketide lactone products (3a-d). (B) In an intrapolypeptide channeling mechanism, a 
diketide that is generated by module 1 from propionyl CoA, methylmalonyi CoA, and 
NADPH is passed intramolecularly from ACPI to KS2. Subsequent elongation and 
reduction afford the triketide lactone 3a. (C) In an interpolypeptide channeling 
mechanism, diketides that have been chemoenzymatically attached to ACP4 by Sfp (4a- 
d) are transferred to the KS domain on a separate polypeptide. Elongation and reduction 
afford the corresponding triketide lactones (3a-d). In all cases suffix "a" refers to the 
(2S,3R) diastereomer, suffix "b" refers to the (2fl,3*S) diastereomer, suffix "c" refers to 
the (2S,3S) diastereomer, and suffix "d" refers to the (2R 9 3R) diastereomer. See also 
Figure 11. 

[0058] Figure 1 1 illustrates the four diketides and their corresponding, putative 
enzymatic products. See also caption to FigurelO. 

[0059] Figure 12 presents the reaction schemes of the three bimodular DEBS 
derivatives-Ml+M5+TE (module 1 4- module 5 + TE), M1+M6+TE, and M1+M2+TE 
(DEBSl+TE)-and their corresponding kc a t values. 

[0060] Figure 13 presents a comparison of the kc^/Ku values (min 1 mM" 1 ) of the 
two syn-diketides when presented as acyl-ACP substrates (4a and 4b) vs when presented 
as NAC-thioesters (2a and 2b). The kcat/KM values for the NAC-thioesters were reporter 
earlier by Wu, N., et al., J. Am. Chem. Soc, 2000, 122, 4847-4852. 
[0061] Figure 14 presents a comparison of the k^t values (min' 1 ) of the four diketides 
when presented as acyl-ACP substrates (4a-d) vs when presented as NAC-thioesters (2a- 
d). <f NT.D." denotes that the product was not detected. 

[0062] Figure 15 presents the following: (A) X-ray film image of SDS-PAGE gel and 
associated conditions of back-transfer experiments; and (B) proposed mechanism for 
back transfer of an exogenously loaded diketide from the KS of a formally downstream 
module to an upstream ACP. 

[0063] Figure 16 is a schematic of 6-deoxyerythronoKde B synthase. Intermodular 
pans are shown in color. LD: loading domain; AT:acyltransferase; ACP: acyl carrier 
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protein; KS: ketosynthase; KR: ketoreductase; DH: dehydratase; ER: enolyreductase; and 
TE: thioesterase. 

[0064] Figure 17 is a schematic of matched and mismatched linker pairs at 
intermodular interfaces. Figure 17 shows how linker pairs at intermodular interfaces are 
selective, yet interchangeable. The left hand panel shows how matched linker pairs 
(triangular/triangular or square/square) facilitate efficient chain transfer. The right hand 
panel shows how mismatched linker pairs (triangular/square or square/triangular) abolish 
chain transfer without affecting the activity of individual modules. 
[0065] Figure 1 8 is a schematic illustrating the channeling of intermediates to 
unnatural recipient modules. Linker pairs can channel intermediates to unnatural 
recipient modules. The left hand panel shows that in addition to communicating with its 
natural partner, module 3 (as shown in Figure 17), module 2 can also communicate with 
modules 5 or 6 as long as linker pairs are matched. The right hand panel shows that such 
communication is totally abolished if unmatched linker pairs are used. 
[0066] Figure 19 is a schematic illustrating the replacement of the entire donor 
module with just the acyl carrier protein. The properties of linkers as shown in Figures 
17 and 1 8 are maintained when the entire donor module is replaced with just the acyl 
carrier protein. 

[0067] Figure 20 is table summarizing the substrates and enzymes used to study 
substrate transfer from a donor to a recipient module. Figure 20 shows how matched 
linker pairs are able to efficiently transfer otherwise poor substrates from a donor to a 
recipient module. Three different diastereomers of the natural substrate of module 2 were 
presented to either module 2 or module 5^ These substrates were presented as N- 
acetylcystearnine (NAC) thioesters or on an ACP carrying a matched linker. In every 
case chain transfer via protein-protein interactions proved to be a useful method for 
substrate delivery. Products A , B, and C were producted at lower rates than the other 
products. 

[0068] Figure 21 shows data supporting a coiled-coil conformation for linkers. 
Circular dichroism and ultracentriguation on synthetic peptides indicate a coiled-coil 
conformation for linkers. The top panel shows circular dichroism (millidegrees) versus 
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wavelength (nm) for Mix, M2C, and FullM3N. The bottom left panel shows InA versus 
radius (mm ) for linkers at module 2 C-terminus. The bottom right panel shows InA 
versus radius 2 (mm 2 ) for linkers at module 3 N-teiminus. 

[0069] Figure 22 is a schematic illustrating how exogenous peptide mimetics of 
linkers can inhibit chain transfer. This figure suggests that the linkers can adopt a 
functional, native conformation. The top panel shows a natural transfer and the bottom 
panel shows how the addition of an exogenous peptide inhibits the transfer. 

[0070] Other features and advantages of the invention will be apparent from the 
following detailed description, and from the claims. 

Modes of Carrying Out the Invention 

[0071] The invention takes advantage of the identification of the amino acid 
sequences for supplying an appropriate linker between modules of a Type I PKS 
depending on the position of the module in the synthetic scheme for the polyketide. If 
the module is at the N-terminus of the polypeptide in which it resides - i.e. 9 there is no 
additional module covalently bound upstream to it, an "interpolypeptide linker"(ERL) is 
placed upstream of the KS catalytic domain. Conversely, if the module resides in a 
polypeptide wherein there is an additional module upstream of it and covalently linked to 
it as a fusion protein, the two modules should be separated by an "intrapolypeptide 
linker" (RAL). If the module residing at the N-terminus of a polypeptide is downstream 
in the synthesis process for a polyketide - Le. 9 if it must accept a nascent polypeptide 
chain from a different module not on the same molecule, it may be necessary as well to 
supply a portion of the interpolypeptide linker at the C-terminus of the module providing 
the nascent polyketide chain in order to assure orderly transfer. 

[0072] In the discussion that follows, polyketide synthases are discussed either at the 
protein level dr the DNA level. As is well understood, manipulation of the sequence of 
amino acids in the polyketide synthase proteins is most conveniently done using 
recombinant techniques. Thus, for example, the appropriate linker sequences can be 
introduced to or modified with respect to those of an existing module by modifying the 
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appropriate gene and expressing it in a suitable host Interchange of linkers is also 
conveniently done in this manner. Further, modifications of amino acid sequences so as 
to obtain 4 Variants" are effected by mutating the gene. The referent polyketide synthase 
should be understood to exist at both the protein level and nucleic acid level, and which 
form is being discussed should be apparent from the context. 

[0073] . Further, the action of polyketide synthases on their appropriate substrates can 
be effected either extracellularly by using isolated enzymes or may be effected by 
producing the enzymes intracellularly. By "appropriate substrate" is meant the extender 
units in their thioester forms that are recognized by the various modules in the PKS and 
"starter" units which are either thioesters of carboxylic acids or partially synthesized 
polyketides such as diketides. For example, as described in PCT application 
PCT/US96/1 1317, the ketosynthase domain of module 1 may conveniently be inactivated 
thus making more efficient the utilization of the diketide by module 2. 
[0074J The linkers can be supplied by conventional recombinant DNA manipulations 
through the use of restriction enzymes and ligation procedures commonly practiced. The 
linkers in the PICS of the invention will be "isolated" from their natural environments. 
By "isolated/' as used herein, is meant simply that the referent is found linked in 
association with moieties with which it is not normally associated, or in an environment 
in which it is not naturally found. It may be linked, if a nucleotide sequence to additional 
sequence with which it is not normally linked, or, if a peptide, to additional amino acid 
sequence with which it is not ordinarily linked, or it may be simply detached from 
additional moieties with which it is usually associated. 

[0075] As seen from Figure 3, the intrapeptide linkers (RAL) of the invention contain 
approximately 16-20 amino acid and typically contain a proline residue at approximately 
the middle of the sequence. On the other hand, the N-terminal upstream interpolypeptide 
linkers are approximately twice as long and appear to contain conserved acidic amino 
acid residues and basic amino acid residues at positions in the upstream half of the 
molecule. Thus, typical N-terminal upstream interpolypeptide linker (ERL) will contain 
an acidic amino acid within the first 3-10 residues, which is followed after 8-10 residues 
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by a basic amino add, and then after another 2-5 amino acid residues by an additional 
acidic amino acid. Additional acidic and basic residues may also occur in these linkers. 
[0076] The intrapeptide linkers or interpeptide linkers shown in Figure 3 can be used 
as described below in the present invention or the corresponding amino acid sequences 
5 from native Type I PKS in general can be employed. In addition to the sequences that 

occur in nature, 'Variants" may be used. These variants are obtained by altering the 
amino acid sequence of the linker in minor ways that do not affect the ability of the linker 
to "feed" the nascent polyketide chain to the module in question. Typically, such 
'Variants" are obtained from the native sequences by amino acid substitution, deletion or 
10 insertion; preferably the substitutions are "conservative" substitutions - i.e. 9 an acidic 

*a% amino acid for a different acidic amino acid, a basic amino acid for a different basic 

M alterations, preferably only two, and more preferably only one. 

jf. [0077] For construction of polyketide synthases which contain more than one 

1 15 poto^aeappn.pria.e^u.oc.ofW^bassuredbyma.chtagtoapp.opn^ 
p C-terminal amino acid sequence of the donating module with the appropriate N-terminal 

p amino acid sequence of the interpolypeptide linker of the accepting module. This can 

*5 readily be done, for example, by selecting such pairs as they occur in native PKS. For 

pj example, two arbitrarily selected modules could be coupled using the C-tenninal portion 

20 of module 4 of DEBS and the N-terminal of portion of the linking sequence for module 5 

ofDEBS. 

[0078] In general, the method of the invention involves supplying to a module used in 
a PKS for synthesis of a desired polyketide with the appropriate N-terminal upstream 
portion interpolypeptide linker (N-ERL), C-terminal downstream portion of an 
25 interpolypeptide linker (C-ERL) or with an intrapolypeptide linker (RAL) at either 

terminus. As stated above, if the module is at the N-terminal portion of a polypeptide, an 
N-terminal upstream interpolypeptide linker should be appended at its N-tenninus. If the 
module resides in a polypeptide where there is an additional module fused upstream from 
it, the two modules should be separated by an intrapolypeptide linker. 
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[0079] For ease of construction, a library of functional modules can be maintained to 
provide the appropriate desired module for construction of the PKS. One way to ensure 
the appropriate sequence of polyketide chain growth is to link the modules covalently, so 
that all but the first module will contain upstream intrapolypeptide linkers. Alternatively, 
and preferably, appropriate communication between functional modules non-covalently 
associated on separate polypeptide molecules can be achieved by providing appropriate 
matching between the C-terminal downstream portion of the interpolypeptide linker 
associated with the module contributing the nascent polyketide chain and the N-terminal 
upstream portion of the interpolypeptide linker placed upstream of the module which 
accepts and extends this nascent polyketide. Thus, an appropriate linker to ensure that 
the growing polyketide chain will be passed from module A to module B, which modules 
are not covalently bound, would be to couple, for example, the C-terminal scaffold 
portion of module 4 from erythromycin to module A and the N-terminal interpolypeptide 
linker (scaffold) portion from module 5 of the erythromycin PKS to the N-terminus of the 
KS of module B. 

[0080] To design and construct the PKS, one straightforward approach is to utilize 
the existing linker regions of a native PKS, such as erythromycin PKS, and simply to 
"plug in" modules, for example from a library. 

[0081] A library of modules derived from naturally occurring PKS which contains 
modules incorporating all alternative extender units used in native PKS combined with all 
variants of beta-carbonyl modification is not large. Extender units that are incorporated 
naturally include malonyl-CoA, methylmalonyl-CoA, ethylmalonyl-CoA, and 
hydroxymalonyl-CoA. The appropriate native molecule for incorporation of each of 
these can readily be found. Methylmalonyl-CoA extender units are incorporated, for 
example, by the modules of the erythromycin PKS. Certain modules of the picromycin 
PKS incorporate malonyl-CoA, while modules of the epothilone PKS incorporate 
ethylmalonyl-CoA or hydroxymalonyl-CoA. Modules occur naturally which contain the 
full spectrum of beta-carbonyl modifying activities; to the extent it is desirable to couple 
a particular beta-carbonyl modifying activity with a particular extender specificity, this 
can be accomplished by altering catalytic domains, per se 9 as described in the above- 
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referenced PCT publication WO 98/49315. The complete combination of extender unit 
choices with all beta-carbonyi modification choices is thus only a total of 4x4 or 
16 modules. As the KS unit determines the stereoselectivity of the module, 
accommodation can be made for various stereoisomers forms of precursor by adjusting 
the KS domain in the module library. This expands the total number of modules 
necessary only to 32. An arbitrary number of modules can be included in a particular 
PKS construct, thus also determining the length of the polyketide chain and the size of 
the macrolide product Of course, the macrolide product can be modified, if desired, by 
the known tailoring enzymes which convert naturally occurring macrolides to 
hydroxylated and/or glycosylated forms and the like. Such modification can be achieved 
in a variety of ways - by chemical modification, by in vitro treatment with appropriate 
enzymes, or by feeding the polyketides to a host organism which contains the appropriate 
tailoring enzymes, as is well understood in the art 

[0082] To construct the desired PKS, modules are selected from the library and 
provided the appropriate upstream intrapolypeptide or inteipolypeptide linkers. Suitable 
linkers can be selected from the group consisting of those shown in Figure 3, the 
corresponding sequences from any Type I PKS, or can include variants of these depicted 
sequences which are conservative in nature and do not interfere with the ability of the 
linker to permit effective uptake of the nascent polypeptide chain. The linkers can be 
added by standard recombinant techniques to the modules in the library, or, the library 
can be composed of the collection of modules wherein each module has been further 
manipulated to include either an intrapolypeptide or interpolypeptide linker. It may be 
desirable, for example, to provide each type of extender module with an intrapolypeptide 
linker, including the possibility of retaining the linker that is normally associated with it. 
If the linker is placed at the N-terminus of the module, the module is suitable for covalent 
linking downstream of an additional module in a single polypeptide. If the 
intrapolypeptide linker is at the C-terminus, ordinarily that module will be placed and 
linked covalently upstream of an additional module. In any case, a module may 
arbitrarily be provided with an intrapolypeptide linker (RAL) at either its N- or 
C-terminus depending on where it is ultimately to be placed in the PKS to be constructed 
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or may be provided with the N-terminal upstream portion of an interpolypeptide linker 
(N-ERL) if it is to be placed at the N-tenninus of a polypeptide in the PKS, or with a 
C-terminal portion of an interpolypeptide linker (C-ERL) if it is to be placed at the 
C-terminus of a polypeptide and is intended to transfer a nascent polyketide chain to a 
subsequent module, 

[0083] The various modules, with appropriate linkers are then assembled into the 
desired polyketide synthase. As stated above, the construction of the PKS can be based 
on plugging in active portions of modules into an existing linker array The assembly can 
be performed by simply mixing the peptides containing the modules or may be generated 
recombinantly from expression constructs in a host cell. The cell may provide the 
appropriate substrates for the PKS, or the substrates may need to be provided to the 
reaction mixture containing the polypeptides or to the cells in which they are generated. 
Depending on the choice of host, provision may need to be made for providing these 
substrates. 

[0084] In this way, the modules can be "mixed and matched" as desired to construct a 
polyketide product from the desired extender units and with the desired beta-carbonyl 
modification, choosing the linkers in accordance with the position of the module in a 
polypeptide, and the number of modules cam be altered as desired. 
[0085] A preferred starter unit for such an assembly of modules is a diketide thioester 
either formed in situ by including a module which contains a loading domain to 
incorporate a starter unit along with an extender unit to attain this resultant, or the 
diketide may be synthesized independently and used as the substrate for the PKS. The 
synthesized diketide may be supplied as the thioester, such as the N-acylcysteamine 
thioesters. Preparation methods for these thioesters are described in the above-referenced 
U.S. Serial No. 09/346,860 filed 2 July 1999 and the corresponding PCT application, as 
well as U.S. Serial No. (Atty. docket No. 30062-20032.00) filed 27 January 2000. 
[0086] Using the techniques of the invention, it is thus possible to manipulate entire 
modules and effect efficient cross-talk so as to assure production of the desired 
macrolide. Such techniques can be used, for example, to alter the structure of macrolide 
anti-infectives by, for example, replacing the module 2 of the erythromycin gene cluster 
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with module 2 of the tylosin gene cluster, or replacing the erythromycin module 6 (along 
with its thioester sequence) with the corresponding module 6 from narbomycin. 
[0087] In addition, 14-membered macrolides could be expanded to become 1 6- 
membered macrolides by fusing modules 2-3 of the tylosin, spiramycin or niddamycin 
modules 2-3 between modules 1 and 3 of the erythromycin synthase or by adding any 
arbitrarily chosen module from other Type I PKS clusters into the synthase for 
production of erythromycin. Alternatively, modules 1-2 of erythromycin could be 
deleted and replaced by modules 1-3 of tylosin, spiramycin or niddamycin. 
[0088] In addition, new substituents can be introduced into, for example, PKS 
erythromycin or its precursors by replacing the second module of the erythromycin PKS 
with module 5 from tylosin PKS where the substituted module has the enoyl reductase 
catalytic activity inactivated. This results in erythromycins substituted with an ethyl 
group at the 10-position. Alternatively, erythromycin module 5 could be replaced by the 
spiramycin module 6 to obtain 5-desmethyl-4-OH erythromycins. 

[0089] Improved forms of FK-506 are obtained by replacing rapamycin modules 2-10 
with FK-506 modules 2-6, or by replacing rapamycin modules 2-1 1 with FK-506 
modules 2-7 or by replacing rapamycin modules 2-12 with FK-506 modules 2-8 or by 
replacing rapamycin modules 1 1-14 with FK-506 modules 7-10. Any combination or 
subset of the above could also be employed. Improved forms of FK-520 can be made in 
a similar manner. An alternative form of rapamycin is synthesized by substituting the 
FK-506/520 module 1 for rapamycin module 1. 

[0090] The foregoing are merely exemplary of the types of manipulations that could 
be employed. The polyketides, obtained by supplying the appropriate substrates either 
in vitro or in vivo, may then be further modified if desired by hydroxylation, 
glycosylation and the like to obtain desired products. Further, chemical synthetic 
manipulations may also be employed 

[0091] Some of the resulting compounds described above could be prepared by 
alternative techniques previously disclosed, for example, in PCT applications 
PCT/US99/22886 or PCT/US99/24483. However, the procedure described above, which 
manipulates entire modules, may result in better yield or more convenient, synthesis. 
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[0092] In addition to housing six modules, the three polypeptides of DEBS each 
possess short, nonconserved segments of amino acid residues located at the N- and C- 
termini of adjacent polypeptides (shown with complementary symbols in Figure 4 and 
described in Example 6 below). A previous study has discussed the importance of 
keeping these interpolypeptide "linkers" intact during the engineering of chimeric PKSs 
(Gokhale, R.S., et aL (1999) Science 284, 482-485. To dissect the precise role of these 
linkers in mediating intermodular interactions, an in vitro system consisting of a donor 
module and an acceptor module was developed and kinetically characterized and is 
described in Example 6 below. Each of the components of this functional linker pair 
could then be replaced with counterparts from other naturally occurring linker pairs. The 
results of these experiments, shown in Example 6 and accompanying figures 4-9, support 
the understanding that the linker regions outside a module's conserved catalytic domains 
impact its interactions with other modules. In addition to their importance in functionally 
connecting modules, these short sequences also play an active role in the protein-protein 
recognition of modules, helping to maintain the selective transfer of intermediates. 
[0093] There are several strategies for rationally manipulating polyketide structure by 
engineering DEBS. For example, it has been demonstrated that DEBS is amenable to the 
introduction of unnatural side chains at the Cu and Cn positions via precursor-directed 
feeding of diketides (Jacobsen, et aL, Science 1997, 277, 367-369; Jacobsen, et aL, 
Bioorg.Med. Chem. 1998, 6, 1171-1177; Hunziker, et aL, Tetrahedron Lett, 1999, 40, 
635-638), as well as via replacement of loading didomains from alternative synthases. 
See Marsden, et aL, Science 1998, 279, 199-202). In addition, protein engineering of 
DEBS can generate truncated pdlyketides, (Kao, et aL, Am. Chem. Soc. 1994, 116, 1 1 612; 
Cortes, et aL, Science 1995* 258, 1487-1489; Kao, C. M., et aL, J. Am. Chem. Soc. 1995, 
117, 9105-9106; Kao, et al.,. Am. Chem. Soc. 1996, IIS, 9184) epimerized polyketides 
(Bohm, et al., Chem Biol 1998, 5, 407-412; Kao, et al., J. Am. Chem. Soc. 1998, 120, 
2478-2479; Holzbaur, et al., Chem. Biol 1999, 6, 189-195; Bycroft, et al., Biochem. 
2000, 267, 520-526), desmethyl polyketides (Oliynyk, et al., Chem. Biol 1996, 3, 833- 
839; Ruan, et al., J. Bacteriol 1997, 779, 6416-6425; Liu, et s&.,Am. Chem. Soc. 1997, 
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119, 10553-10554; Lao, et al., Biochemistry 1999, 38, 1643-1651), polyketides 
containing various degrees of modification of the /3-keto groups (Donadio, et al., Science 
1991, 252, 675-679; Donadio, et al., Proc. Natl Acad. Set U.SJi 1993, 90, 7119-7123; 
Bedford, et al., 1996, 3, 827-831; McDaniel, et al., Am. Chem. Soc. 1997, 119, 4309- 
4310; Kao, C. M., et al., Am. Chem. Soc 1997 \ 119, 1 1339-11340), and combinations 
thereof. See McDaniel, et al., Proc. Natl Acad. Sci. U.S.A 1999, 96, 1846-1851. 
However, one approach for generating diversity in polyketides lhat has been exploited 
only to a limited extent (Gokhale, et al., Science 1999, 284, 482-485; Ranganathan, et al., 
Chem. Biol 1999, 6, 731-741) is the fusion of intact modules (or groups thereof) from 
different PKSs to generate chimeric assembly lines. While the application of such a 
strategy takes advantage of the natural catalytic grouping of the modules to produce 
enzymes of improved catalytic effectiveness, two major issues must be addressed to 
rationally implement a modular rearrangement strategy for combinatorial biosynthesis. 
First, the molecular recognition features of individual modules need to be deciphered, so 
that their placement in hybrid PKSs can be restricted to catalytically productive contexts. 
Second, the mechanistic basis for transferring intermediates between adjacent modules 
must be understood, so that intermodular chain transfer can efficiently occur between 
heterologous modules. This report provides new insights into the relative importance of 
both of these issues and their interrelationships in the context of a multimodular PKS. 
[0094] The tolerance and specificity of individual modules of DEBS have been 
indirectly investigated using a variety of genetic, biochemical, and chemical 
approaches. 25 Recently, it has been possible to express and reconstitute individual DEBS 
modules as intact proteins. See Gokhale, et al., Science 1999, 284, 482-485. This 
allowed us to directly assess the substrate specificities of four modules of DEBS 
(modules 2, 3, 5, and 6) using a set of N-acetylcysteamine (NAC)-activated diketides as 
potential substrates (2a-d, Figure 10A), Wu, et aL,Am. Chem. Soc. 2000, 122, 4847- 
4852. Surprisingly, not only did the substrate specificity profiles of these four individual 
modules turn out to be quite similar, but these profiles also did not correlate well to the 
structures of the natural substrates of individual modules. Separately, recent experiments 
have suggested that short intermodular linker sequences play an important role in the 
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selective transfer of polyketide intermediates between modules. See Gokhale, et al., 
Science 1999, 284, 482-485; Tsuji, et al., Biochemistry 2001. Therefore, we considered it 
appropriate to reexamine the steady-state kinetic parameters of individual DEBS 
modules, but this time, to pay closer attention to possible protein-protein interactions that 
could be involved in passing a substrate from an upstream module to its downstream 
neighbor. 

[0095] There are two modes by which a substrate can be passed from one module to 
the next If the two successive modules are on the same polypeptide (such as modules 1 
and 2 of DEBS), there is an intrapolypeptide chain transfer. On the other hand, if the two 
successive modules are on separate polypeptides (such as modules 2 and 3 of DEBS), 
there is an interpolypeptide chain transfer. In either case, biosynthetic intermediates 
undergo direct interfhiol transfer between adjacent modules such that the intermediates 
never go into bulk solution. We refer to this property as the '^physical channeling" of 
intermediates between modules. 

[0096] Physical channeling (also commonly referred to as substrate channeling) is 
defined as a mechanism in a sequence of reactions in which reaction intermediate is 
transferred from one active site to the downstream active site without equilibrating with 
the bulk solution. See Kirsch, et al., Biochemistry 1999, 38, 8032-8037. Physical 
channeling of intermediates can provide kinetic benefits by increasing the effective 
concentration of the substrate, protecting labile intermediates from unproductive 
reactions, and precluding entrance of intermediates into competing enzymatic pathways. 
Furthermore, substrate channeling between two enzymes can help overcome product 
inhibition of the upstream enzyme by tunneling the intermediate out of the upstream 
binding pocket and into the downstream binding pocket more efficiently. 
[0097] While physical channeling is a necessary outcome of fundamental polyketide 
biosynthetic mechanisms (Donadio, et al., Science 1991, 252, 675-679; Cortes, et al., 
Nature 1990, 348, 176-178), the kinetic advantage, if any, of channeling intermediates 
between modules has not yet been resolved. To elucidate the issue of << kinetic 
channeling" (which is defined as physical channeling that results in a kinetic advantage- 
as measured by kcat sover a diffusive loading mechanism in which the intermediate 
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equilibrates in the bulk phase after release from the upstream active site and before 
loading in the downstream active site) in modular PKSs, two new assay systems-one to 
probe intrap olypeptide transfers and one to probe inteipolypeptide transfers— were 
devised that would more accurately mimic the transfer of a substrate from the acyl carrier 
5 protein (ACP) of one module to the ketosynthase (KS) of the next. These assays are 

described in further detail in Example 7 below. In the first assay system, the loading 
didomain and module 1 of DEBS generated in situ the natural diketide intermediate 
((25,3i?)-2-methyl-3-hydroxy-pentanoyl-S-ACPi) > which could then be transferred to 
alternative downstream modules in a bimodular PKS context (Figure 10B). By 
3^ 10 comparing the kinetic parameters of these hybrid bimodular systems to those for 

P elongation of the same diketide that has been supplied exogenously to the isolated 

O 

J3 downstream module (Figure 10A), the kinetic benefit of channeling intermediates 

i& 

jlj between covalently linked modules could be evaluated. A second assay system was 

*P developed using a chemoenzymatic method, through which alternative diketides were 

^15 covalently attached to the phosphopantetheine aims of an individually expressed donor 

® ACP domain (Figure 10Q. Here, the entire diketide-S-ACP adduct (4a-d) is a formal 

O substrate for a recipient module, therefore allowing investigation of inteipolypeptide 

p channeling. (The linker sequence at the C-terminal end of the ACP as previously 

described, see Tsuji, et aL, Biochemistry 2001, was included in this construct.) By 
20 attaching different diketides to the same ACP, the steady-state kinetic parameters for 

diketide elongation by individual modules (each with a TE domain fused to its C 
terminus to facilitate turnover) could be measured. Both assay systems were used to 
compare the properties of modules 2, 5, and 6 of DEBS, three modules that perform the 
• same chemistry with identical stereocontrol, albeit on very different substrates (Figure 4). 
25 The results of these studies are described in Example 7 below. 



[0098] The present invention is further described by the compounds and methods 
described in the following examples. The examples are provided solely to illustrate the 
invention by reference to specific embodiments. These exemplifications, while 
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Preparatioii A 
Construction of Single Module Based Systems 

Single Module Gene Constructs 

[0099] Single module constructs from the DEBS gene cluster were prepared for 
5 modules 2, 3, 5 and 6 as follows. The TE domain is fused to the module to facilitate 

termination. The (M3+TE) gene was prepared from the tri-modular construct pKA03 1 8 
(McDaniel, R., et ah, Chem. Bioh (1997) 4:667) having an Nhel site engineered at the 
start of the DEBS-2 gene. Fusion of the TE gene at the end of ACP3 was described in 
connection with the construction of pCK13 in Cortes, J., et ah, Science (1995) 268:1487; 
Jj 10 Kao, C.M., et ah, J. Am. Chem. Soc. (1995) 117:9105 and Kao, CM., et ah, J. Am. Chem. 

O Soc. (1996) 1 18:9184, collectively cited below as the "Cortes-Kao documents." The 

m Nhel-EcoRI fragment was cloned into pET 21c (Novagen) to construct pRSG34. The 

EcoRI site was used to delete the stop codon of the TE domain so that the protein could 
*P be overproduced as a carboxy terminal (His)6 tagged fusion protein. 

P 1 5 [001 00] (M5+TE) was constructed by combining the engineered Ndel site from 

|j pJRJIO (Jacobsen, et aL, Biochem (1998) 37:4928) with the EcoRI site from pCKl 5 

*§£ (Cortes-Kao documents). The Nde-EcoRI fragment was cloned in pET21c to obtain the 

O 

H § expression plasmid pRSG46. Expression constructs for (M2+TE) and (M6+TE) were 

prepared similarly using an engineered Nhe site immediately upstream of the 

20 corresponding KS (at position 7570, 5'-GCTAGCGAGCCGATC-3' (SEQ ID NO:l) and 

at position 28710, 5'-GCTAGCGACCCGATC-3' (SEQ ID NO:2)). 
[00101] These constructs were expressed in E. coli BL21 (DE3) along with an 
expression system for sjp phosphopantetheinyl transferase from B. subtilis. The 
co-expression is described by Lambalot, R.H., et ah, Chem. Bioh (1996) 3:923. For the 

25 construction of the sjp gene, the Ndel-HindHI fragment derived from the pUC8-^ 

(Nakano, et ah, Moh Gen. Genet (1992) 232:313) was cloned into pET28 which has a 
kanamycin resistance gene to give resultant plasmid pRSG56. The resulting proteins 
were then isolated for use in the reaction mixtures described in the Examples below. 
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[00102] In more detail, the expression was induced with 1 mM 

isopropyl-b-D-thiogalactopyranoside, and the cells were harvested by centrifugation after 
1 0 hours and resusp ended in disruption bufiFer, 200 mM sodium phosphate pH 7.2, 
200 mM sodium chloride, 2.5 mM dithiothreitol, 2.5 mM sodium ethyienediamine tetra- 
acetate (EDTA), 1.5 mM benzamidine, 2 mg/L pepstatin and leupeptin and 30% v/v 
glycerol. The cells were lysed by passing through a french press, and the lysate was 
collected after centrifugation. Nucleic acids were precipitated with polyethylenimine 
(0. 15%) and removed via centrifugation. The supernatant was made 50% (w/v) saturated 
with ammonium sulfate and precipitated overnight. After centrifugation, the pellet 
containing protein was redissolved in buffer A (100 mM sodium phosphate pH 7.2, 
2.5 mM DTT, 2 mM EDTA and 20% glycerol (v/v)) and stored at -80°C. For 
chromatography, the buffer was exchanged to buffer A +1 M ammonium sulfate using a 
gel filtration PD1 0 (Pharmacia) column. The resulting sample was loaded on a Butyl 
Sepharose (Pharmacia) column. Fractions containing DEBS proteins were pooled and 
applied on an anion exchange column (Resource Q; 6 mL, Pharmacia). Purified protein 
fractions were pooled and concentrated using Amicon centriprep30. Typical purified 
protein yields were ~ 3-4 mg/liter of culture. Greater than 90% of proteins were 
phosphopantetheinylated in vivo as a result of the overexpression of sfp 
phosphopantetheinyl transferase. Although the proteins were expressed as (His)6-tagged 
proteins, they did not bind to a Ni-column under experimental conditions. It is unclear 
whether this inability to bind to a Ni-agarose column is due to steric effects or if the 
(His)6 peptide was lost during purification. 

Example 1 

Requirements f or Cell-Free Synthesis of Triketides bv Individual Modules - 
Identification of Linker Regions 

[00103] A cell-free system, tested for the ability to convert the cysteamine thioester of 
2S,3R-2-methyl-3-hydroxypentanoic acid (compound 2 in Figure 2), consisted of 
0.5-10 mM concentration of the thioester, 2.5 mM 14 C-labeled methyhnalonyl CoA and 
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1 00 pmoles of purified protein prepared in Example 1 in a 100 pi reaction. In some 
cases, 1 mM of NADPH was added; this was done in assay mixtures containing 
(M2+TE), (M5+TE) and (M6+TE) proteins. The protein was, in each case, a single 
module of the DEBS PKS fused to the thioesterase (TE) tennination region of module 6. 
[00104] Hie reaction mixtures were quenched and extracted by ethyl acetate and 
separated by thin-layer chromatography (TLC) to discern the formation of the triketide 
ketolactone 3 and triketide lactone 4 (both shown in Figure 2) after 30 minutes. Results 
are shown as the first four entries in Table 1. 

Table 1 

Fonnation of Triketides by Single Module Constructs 





Construct 


Plasmid 


Triketide Formed 


1. 


M3+TE 


pRSG34 


Yes 


2. 


M5+TE 


pRSG46 


Yes 


3. 


M2+TE 




No 


4. 


M6+TE 




No 


5. 


ERL-M2+TE 


pRSG64 


Yes 


6. 


ERL-M6+TE 


pRSG54 


Yes 


7. 


M1-RAL-M6-TE 


pST96 


Yes 


8. 


Ml -RAL-M3-TE 


pST97 


Yes 


9. 


ery Ml-RAL-n/M5+TE 


pSTHO 


Yes 


10. 


(ery Ml-RAL n/M5+ERL)+DEBS-2+DEBS-3 


pST113 


Yes (dEB6) 



[00105] As seen in Table 1, although the expected triketides were formed from 
(M3+TE) and (M5+TE) (modules which reside at the upstream portion of their respective 
polypeptides), no triketides were formed from (M2+TE) or (M6+TE), (modules which 
reside at the C-teiminal portions of their polypeptides). These latter results were 
unexpected since the diketide can be incorporated by module 2 when it is supplied as a 
part of the complete polypeptide DEBS-1 . It was verified that the ACP domain was 
pantetheinylated in modules 2 and 6, and that for (M2+TE), the KS domain could not be 
acylated with radiolabeled diketide. 
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Example 2 

Modification of Single Modules with Linker Sequences 

[00106] The constructs for (M2+TE) and (M6+TE) were modified by deleting the 
sequences encoding the amino acids upstream of the KS catalytic domain and substituting 
the first 39 amino acids from (M5+TE) containing the N-tenninal portion of this 
interpolypeptide linker (N-ERL). The relevant constructs were prepared by replacing the 
BsaBI-EcoRI fragment in pRSG46 by the corresponding fragment from pCK4 to obtain 
(N-ERL-M2+TE), in plasmid pRSG64, or from pJRJIO to obtain (N-ERL-M6+TE) in 
plasmid pRSG54. These constructs yield modules which contain the upstream 39 amino 
acids from module 5, The constructs were expressed in E. coli and proteins obtained as 
described in Preparation A. These proteins were able to produce the triketide product 
from diketide in the cell free system of Example 1, as shown in entries 5 and 6 in Table 1 . 
[00107] The various constructs which are successful in converting diketide to triketide 
were then evaluated for the kinetic constants kcat and Km- These results are shown in 
Table 2. As shown in Table 2, the results are quite similar for all constructs except that 
the results from module 3 show a several-fold decrease in k^t as compared to the other 
modules. This is evidently due to the absence of beta-carbonyl modifying enzymes in 
module 3 as verified by the fact that removal of NADPR, (which is required for the 
activity of such modules) from the reaction mixture of (N-ERL-M6+TE) also results in a 
lowering of the k^ 

Table 2 

Proteins k^t X 100 (min* 1 ) Km (mM) 

(N-ERL-M2+TE) 8 ±0.6 4.6 ±0.4 

(M3+TE) 1.5 ±0.3 4.4 ±0.4 

(M5+TE) 7.5 ± 0.7 4.7 ± 0.4 

(N-ERL-M6+TE) 9.5 ± 0.6 4.3 ± 0.4 

(N-ERL-M6+TE) 4.5 ± 0.7 4.1 ± 0.4 
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(-NADPH) 

[00108] It is apparent from these results that the presence of the N-terminal upstream 
sequence associated with modules located at the N-terminal portion of the polypeptide is 
essential for permitting a module in this position to incorporate the growing polyketide 
chain. 

5 Example 3 

Construction of AV11-RAL-M3+TE1 and (M 1 -RAL-M6+TF1 

[00109] The BsaBI-EcoRI fragments containing modules 3 and 6 respectively were 

p cloned behind the Ml module which contains the intrapolypeptide linker (RAL) that 

© natively resides between Ml and M2. The resulting M1-RAL-M3+TE and 

10 M 1 -RAL-M6+TE genes were then excised as PacI-EcoRI fragments and inserted into 

% pCK12 resulting in plasmids pST97 and pST96 respectively. The corresponding proteins 

*P were produced by transformation into S.. coelicolor CH999. The resulting strains of 

Jjj S. coelicolor were able to incorporate the diketide thioester into the triketide as shown by 

jW entries 7 and 8 in Table 1 . (The triketide produced is the ketolactone 3 in Figure 2.) 

£ 

5 15 Example 4 

Additional Intrapolvpeptide Mediated Transfer 

[00110] A construct wherein the first module of the DEBS PKS cluster (ery) 9 which 
contains the intrapolypeptide linker of the corresponding M1-M2 polypeptide from the 
erythromycin PKS, is fused to the fifth module of the rifamycin PKS (rij) was 
20 constructed by replacing the natural sequence at 28024 of n/ACP5 (5'-CGCGAC-3') 

with the ^e/ recognition sequence 5'-ACTAGT-3\ The BsaBI-Spel fragment 
containing rifWIS was excised and replaced the corresponding ery Ml -RAL- fragment in 
pCK12 to obtain plasmid pSTl 10. This plasmid, containing ery M1-RAIwt^M5+TE 
was transformed into & coelicolor CH999 and the resulting strain was able to incorporate 
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the diketide into the triketide lactone as shown by entry 9 in Table 1. The amount is 
comparable to that produced in this strain transformed with DEBS-1+TE. 

Example 5 

Construction of Modules for Intermolecular Transfer 

5 [00111] The PacISpel fragment of pSTl 10 was inserted into a derivative of pCK7 

(Kao, CM., et al, Science (1994) 265:509) which had an Spel site engineered at the 
beginning of the scaffolding sequence at the caiboxy terminus of the polypeptide 
downstream of ACP2. The resulting pSTl 13 construct still contains ery Ml linked to 
g rifMS via the natural intrapolypeptide linker between ery molecules 1 and 2, and also 

© 10 now contains rifM5 covalently linked to the downstream C-terminal portion of the ERL 

derived from ery M2. Thus, the complete ERL between the polypeptide generated by 
*^ pSTl 13 and the protein generated by a construct which generates DEBS-2 would 

45 correspond to the native ERL in the ery PKS - i.e., rifMS would be associated with ery 

p M3 via the natural interpolypeptide linker between ery molecules 2 and 3. Co- - 

|j 15 transformation into & coelicolor of pSTl 13 along with constructs that produce DEBS-2 

,p and DEBS-3 results in the production of 6-dEB, as shown by entry 10 of Table 1 . 

© 

§ ^ Example 6 

Construction of Modules for Interpolypeptide Transfer With Matched and Mismatched 
Linker Pairs fM2 and M3+TR M2 and mM3+TE- M2(4) and M3+TE. and M2(4) and 
20 (5)M3-KTE^ 

[00112] Reagents and Chemicals. DL-[2-we%/- 14 C]Methylmalonyl-CoA (56.4 
mCi/mmol) was obtained from ARC, Inc. 

[00113] The N-terminal linker of M3 was synthesized by New England Peptide 
(Fitchburg, MA). The peptide sequence was as follows, M3 N-term: H 2 N- 
25 MTDSEKVAEYLRRATLDLRAARQRIRELESD-atnide. 
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[00114] Construction of Plasmids. Plasmid pBP19 contains module 2 of DEBS 
(M2) and is a derivative of pRSG64 (Gokhale, R. S., et ah, (1999) Science 284, 482-485), 
where the thioesterase domain was replaced with a Spel-EcoKL fragment containing the 
natural C-tenninal linker for module 2 to make pBP19. Plasmid pST179 encodes a 
derivative of M2 containing the C-tenninal linker of DEBS module 4 (M4). The C- 
terminal linker of M4 was obtained as a Spel-EcoRl fragment by PCR using the primers 
5'-ACT AGT AGG CTG TTC GCG GCC TCA C-3' and 5'-G GGA ATT CAG GTC 
CTC TCC CCC GC-3' (bold sequences complement DEBS sequence). The PCR 
amplicon was inserted after M2 using the engineered sites, yielding pST179. This 
plasmid, pRSG34, encodes module 3 of DEBS (M3) with its own N-terminal linker and 
with the thioesterase fused to the C-terminus. Its construction has been described 
previously (id.). Plasmid pST132 encodes a derivative of M3 + TE, where the natural N- 
terminal linker of pRSG34 has been replaced with the N-terminal linker of module 5 of 
DEBS (MS). This substitution required the replacement of the Ndel-BsdBI fragment of 
pRSG34 with the corresponding fragment from pJRJIO (Jacobsen, J. R., et ah, (1998) 
Biochemistry 37, 4928-4934). All constructs were cloned into pET-21c (Novagen) 
vectors for expression in Escherichia coli. 

[00115] Strain and Culture Conditions. Expression of the desired proteins was 
achieved by transforming the above plasmids into an engineered strain of E. coli 
BL21(DE3) containing the sjp phosphopantetheinyl transferase gene from Bacillus 
subtilis (Lambalot, R. H., et aL 9 (1996) Chem. Biol 3, 923-936). The sfp gene product 
was required to posttranslationally modify the acyl carrier protein (ACP) domains by 
phoshopantetheinylating the apo-ACP (Gokhale, R. S., et ah, (1999) Science 284, 482- 
485). Cells containing the expression plasmids were selected with carbenicillin and used 
to inoculate a 10-20 mL LB medium starter culture grown at 37°C. After 6 h, the cells 
were pelleted and used to inoculate two 2 L flasks containing 1 L of LB medium each. 
The flasks were shaken at 250 ipm at 37°C until the culture optical density at 600 nm 
(OD 6 oo) was 0.6. The flasks were placed in a water bath to cool the cells to 22 C (ca. 10 
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mrn) and then induced with 0.5 mM isopropyl j8-D-thiogalactopyranoside at an OD600 — 
0.8. Flasks were then stirred at 22-24°C for 10-14 h. 

[00116] Purification of Proteins. After induction, the cells were harvested via 
centrifiigation and washedin 50 mM Tris (pH 8) and 1 mM ethylenediaminetetraacetic 
acid (EDTA) before being resusp ended in disruption buffer [200 mM sodium chloride, 
200 mM sodium phosphate, 2.5 mM dithiothreitol (DTT), 2.5 mM EDTA, 1.5 mM 
benzamidine, pepstatin and leupeptin (2 mg/L), and 30% (w/v) glycerol]. The cell 
suspension was lysed at 1250 psi using a French press and then centrifuged. 
Polyethylenimine was added to the supernatant to 0.15% to precipitate nucleic acids. 
Following the centrifiigation (20 nnfn at 333 OOg) to remove the nucleic acids, ammonium 
sulfate was added to the supernatant until a 50% (w/v) saturation was achieved and 
allowed to precipitate for 2-3 h. The pellet following a 45 min centrifiigation (33300g) 
was resusp ended in buffer A [100 mM sodium phosphate (pH 7.2), 2 mM DTT, 1 mM 
EDTA, and 20% (v/v) glycerol]. The resulting suspension was applied in 2.5 mL 
aliquots to a 9.1 mL gel filtration column (PD-10, Pharmacia) equilibrated with buffer B 
(buffer A+1M ammonium sulfate) and eluted in 3.5 mL of buffer B. This eluant was 
applied to a 30 mL hydrophobic-interaction column (Butyl-Sepharose 4 FastFlow, 
Pharmacia) at 1 mL/min. Elution was performed at 1 mL/min with stepwise changes in 
buffer starting from 1 00% buffer B, to 40%, 20%, and 0%. Steps were made when the 
absorbance at 280 nm approached baseline. Fractions were 10 mL, and those containing 
the protein of interest (typically eluted with 0% buffer B) were pooled and applied to an 
anion-exchange column (Resource Q, 6 mL, Pharmacia) at 1 mL/min. A gradient of 0- 
0.15 M NaCl in buffer A was run at 1 mL/min for 3 column volumes, followed by a 
gentle gradient of 0.15-0.30 M NaCl at 1 mL/min for 10 column volumes. Fractions of 2 
mL were collected, and those containing concentrated protein (typically 0.22-0.25 M 
NaCl) were pooled and further concentrated on Centriprep 50 membranes (50 kDa 
molecular mass cutoff; Amicon) to a concentration of 0.1-4 mg/mL. Protein 
concentrations were measured via the modified Lowry assay (Sigma) and densitometric 
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analysis of SDS-PAGE gels stained with Coomassie Blue. On the basis of the 
densitometry data, all proteins were determined to be >90% pure. 

[001171 In Vitro Polyketide Production. Assays of individual modules contained 
1 .0/xM protein, 1-10 mM iV-acetylcysteamine thioester of the Natural" (2£3i?)-2-methyl~ 
3-hydroxypentanoic acid diketide (NDK) (1), 4 mM NADPH, 440 mM sodium 
phosphate, 1 mM EDTA, 2.5 mM dithiothreitol (DTT), and 20% w/v glycerol, pH 7.2, in 
80/iL. Reactions, with M2 or M2(4) included 0.3 mM 14 C-methylmalonyl-CoA (specific 
activity adjusted to 10.4 mCi/mmol), and those with M3 + TE or (5)M3 + TE included 
0.5 mM 14 C-methylmalonyl-CoA (specific activity reduced to 1.1 mCi/mmol). [M2(4) 
refers to a derivative of M2 in which the C-terminal linker has been replaced with its 
counterpart from module 4, whereas (5)M3 refers to a derivative of M3 in which the N- 
tenninal linker has been replaced with its counterpart from module 5 (see Figure 4).] 
Both concentrations of methylmalonyl-CoA were saturating, but different concentrations 
were needed due to the disparate rates of turnover between the modules with and without 
the thioesterase. The reaction mixtures were preincubated to 30°C and started with the 
addition of the methylmalonyl-CoA. The reactions were incubated at 30°C, and at three 
to four time points, 20/iL aliquots were removed and quenched with ethyl acetate, which 
was extracted twice (450jxL total) to isolate the polyketide products. The ethyl acetate 
layers were pooled, dried in vacuo, and applied to an analytical thin-layer 
chromatography (TLC) plate (Si250F; Baker). The TLC plate was developed using 50% 
ethyl acetate in hexanes as the mobile phase (triketide ketolactone 4, R/= 0.5; triketide 
lactone 3, 0.4) and then visualized by electronic autoradiography (Instantlinager, 
Packard Instruments). Product formation was quantified by comparison with standards of 
the labeled methylmalonyl-CoA. The identity of products 2, 3, and 4 has been 
unambiguously established in earlier in vitro enzymatic studies (Pieper, R., et al 9 (1995) 
J.Am. Chem.Soc. 117, 11373-11374; Pieper, R., et ah, (1997) Biochemistry 36, 1846- 
1851) and was reconfirmed by chromatographic comparison with authentic reference 
samples. 
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[00118] Assays of M2 and M3 + TE contained l.OjLtM M2 and 0.4-4/tM M3 + TE, 7 
mM NDK, 0.5 mM 14 C-methylmalonyl-CoA (specific activity reduced to 3.4 mCi/mmol), 
4 mM NADPH, 440 mM sodium phosphate, 1 mM EDTA, 2.5 mM DTT, and 20% w/v 
glycerol, pH 7.2, in 70/iL. Assays of M2(4) and (5)M3 + TE were identical, except they 
contained 0.5/xM M2(4), 0.5-5iuM (5)M3 + TE, and 0.4 mM 14 C-methylmalonyl-CoA 
(specific activity reduced to 6.2 mCi/mmol). The concentration of M2(4) was limiting in 
order to facilitate its saturation with (5)M3 + TE. The reactions were prewamied at 30°C 
and initiated by the addition of the methylmalonyl-CoA. As described above, 20 fiL 
aliquots were removed at various time points and processed. Extracts loaded onto TLC 
plates were separated using either 80% ethyl acetate in hexanes or 5% methanol in 
dichloromethane, both of which allowed identification of the tetraketide lactone 2 and the 
triketide lactones 3 and 4. 

[00119] Inhibition of Tetraketide Production. The ability of the synthetic peptide to 
inhibit the transfer reaction, and thus the production of tetraketide, was tested under the 
same reaction conditions described for the two module coincubations. The 
concentrations of M2 and M3 + TE were both 1.0/xM, and the concentrations of M2(4) 
and (5)M3 + TE were 0.5 and 1.0/xM, respectively. The only difference was the addition 
of the peptide at concentrations ranging from 1 to 100/dVl to the assay containing M2 and 
M3 + TE [or alternatively M2(4) and (5)M3 + TE]. For greater accuracy, each inhibition 
assay was performed side by side with a control lacking inhibitor. The effect of the 
inhibitor was thus determined by dividing the inhibited rate by the control rate. 

[00120] Kinetic Analysis. For individual modules, the steady-state turnover number 
was determined from the time course of triketide formation, normalized to the 
concentration of protein. The dependence of the rate on substrate concentration was 
measured by varying the concentration of NDK while maintaining saturating levels of 
NADPH and methylmalonyl-CoA. From these data, the Ab a t and K M were calculated by 
fitting the normalized v versus [S] plots to the Michaelis-Menten equation. 
[00121] For tetraketide formation, the rate of production of tetraketide was recorded 
for varying concentrations of M3 + TE [or (5)M3 + TE] at a fixed concentration of M2 
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[or M2(4)] and saturating concentrations of substrates. By fitting the rate dependence of 
tetraketide to a saturation curve, the maximal velocity Ow) of tetraketide production was 
determined, and was assumed to represent the case where every M2 homodimer was 
productively associated with an M3 + TE homodimer. Thus, the affinity of this protein- 
protein interaction could be calculated from the rate of tetraketide formation, as 
represented in Equation 1 

([M2], [M3], and [M2-M3] refer to the concentrations of M2, unbound M3 + TE, and the 
M2/M3 + TE complex, respectively). Since [M2-M3] is related to the K D of M2 and M3 
+ TE as shown in Equation 2, 
K - tM2][M31 

which can be rearranged to yield Equation 3, 

where [M2] 0 = total concentration of M2, the velocity of tetraketide production can be 
defined relative to the Kj>: 



where v = Abat[M2-M3] and = Abat[M2]o. Thus, fitting of the v versus [M3] plot 
(which is equivalent to the bound M3 + TE versus free M3 + TE plot used for Scatchard 
analysis) to Equation 4 allowed determination of the K D for M2 and M3 + TE association. 

[00122] CD Spectroscopy. The CD spectrum of the M3 N-teiminal peptide was 
recorded in a 1-mm path-length cell at a sample concentration of 100/iM in phosphate- 
buffered saline (PBS; 0.15 M KC1, 25 mM phosphate, pH 6.9). Measurements were 
made using an Aviv 62DS spectropolarimeter. Concentration was determined by tyrosine 
absorbance at 275 nm in 8 M guanidine hydrochloride. 
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[00123] Kinetic Analysis of Individual Modules. To directly measure the effect of 
linker replacement, individual modules with substituted linkers were kinetically 
characterized using the natural diketide (NDK) as substrate. The only difference between 
the M3 + TE and (5)M3 + TE proteins was that the former contained the N-terminal 
5 linker of M3, whereas the latter contained the N-terminal linker of M5. As shown in 

Figure 5, the modules displayed very similar kinetics. The calculated feat values were 
0.71 ± 0.1 and 0.68 ± 0.1 min 1 and the K u values were 2.8 ± 1.5 and 2.5 ± 1.0/iM for M3 
+ TE and (5)M3 + TE, respectively. Similar experiments were performed to compare M2 
and M2(4), where the C-terminal linker of M2 had been swapped. Since the rate of 
^ 10 turnover of NDK to triketdde lactone 3 by module 2 lacking the fused thioesterase domain 

O was low, however, only an approximate k^t could be estimated [0.02 and 0.03 min" 1 for 

M2 and M2(4), respectively]. Thus, replacement of either the N-tenninal or the C- 
W teiminal linker of a module does not appear to affect its intrinsic catalytic properties. 

as 

, [00124] Kinetic Analysis of M2-M3 Coincubations. Upon coincubation of M2 and 

£j 15 M3 + TE in the presence of NDK, methylmalonyl-CoA, and NADPH, tetraketide lactone 

S3 2 was formed (Figure 6) with a maximum rate constant of 0.27 ± 0.01 min" 1 . The &b a t of 

p M2 with M3 + TE was determined from the saturation curve determined using a fixed 

concentration of M2 and a variable concentration of M3 + TE and correlated well with an 
earlier measurement of 0.23 min' 1 for formation of the same tetraketide by DEBS1 and 
20 M3 + TE (Pieper, IL, et al, (1997) Biochemistry 36, 1846-1851). Thus, the M2 protein 

described here appears to be a viable alternative to DEBS1 as a donor of the triketide 
intermediate to M3 + TE. From the saturation curve, slKu of 1.1 =fc 0.1/iM for the M2-M3 
+ TE interaction could be estimated (Figure 7A; see also Materials and Methods section). 
Again, this value compares well with the previously reported value of 2.6pM for DEBS1 
25 and M3 + TE (Gokhale, R. S., et al, (1999) Chem. Biol 6, 117-125). 

[00125] Analogous to the above study, coincubation of M2(4) with (5)M3 + TE 
allowed examination of the effects of the transplanted DEBS2-DEBS3 linker pair (Figure 
4) on chain transfer between modules 2 and 3 . The efficiency of chain transfer in the 
presence of this alternative linker pair (Figure 6D) was established by measuring the 
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same parameters associated with synthesis of the tetraketide lactone 2 [feat of 0.74 ± 0.06 
min" 1 and a Ro of 2.1 ± 0.4 /xM (Figure 7B)]. 

[00126] Effects of Mismatched Linker. Pairs. In contrast to the above studies with 
M2 and M3 + TE (Figure 6A) or M2(4) and (5)M3 + TE (Figure 6D), both of the 
noncomplementary coincubations [M2 with (5)M3 + TE (Figure 3B) and M2(4) with M3 
+ TE (Figure 6C)] produced the tetraketide lactone 2 at severely compromised rates. The 
highest rate constants that could be measured with either mismatched system were in the 
range of 0.02-0.03 min" 1 . Given these low rate constants, the dependence of rate on 
protein concentration could not be measured. As might be expected, the rate of 
production of ketolactone 4 increased markedly in both cases relative to the matched 
incubations above (data not shown), since module 3 remained active in these mismatched 
coincubations. 

[00127] Inhibition of Tetraketide Production by a Synthetic Peptide. Sequence 
analysis using the CoilScan program (Lupas, A., et aL 9 (1991) Science 252, 1 162-1 164; 
Lupas, A., (1996) Methods Enzymol 266, 513-52) revealed that the N- and C-terminal 
interpolypeptide linkers of DEBS contained 15-20 residue segments with strong 
propensity to assume a coiled-coil structure. Since the N-terminal linker of module 3 is 
relatively short (31 residues), a peptide corresponding to this sequence was synthesized 
(see Materials and Methods). As shown in Figure 8, this linker could inhibit the 
formation of 2 in the presence of M2 and M3 4- TE in a concentration-dependent maimer. 
To test the specificity of this peptide inhibitor, a similar assay was also conducted in the 
presence of M2(4) and (5)M3 + TE. No inhibitory effect was observed at peptide 
concentrations up to 100/iM (Figure 8). Furthermore, the peptide also showed no 
inhibitory effect upon individually assayed modules, including M3 + TE. Thus, isolated 
peptide linkers appear to be selectively capable of competing for their cognate module- 
bound partners without affecting their individual catalytic activities. 
[00128] The N-terminal linker peptide of M3 was analyzed via circular dichroism 
(CD) to assess its ct-helical character. As shown in Figure 9, the spectrum shows the 208 
and 222 nm absorbances characteristic of ex-helices. The magnitude of these peaks 
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allowed us to estimate that the peptide was approximately 50% helical (Chen, Y. H., et 
al, (1974) Biochemistry 13, 3350-3359). The CD of the peptide appeared invariant with 
peptide concentration, salt concentration, and phosphate concentration. 

[00129] Earlier studies suggested the role of structurally intact intermodular linkers in 
facilitating chain transfer between noncovalently associated modules of PKSs (Gokhale, 
R. S., et al, (1999) Science 284, 482-485). Here we have extended and elaborated these 
findings in several significant ways. First, our results have vividly demonstrated the 
selectivity associated with linker-mediated chain transfer (Figure 6). Although 
intermodular chain transfer can occur between modules possessing mismatched linker 
pairs, a serious kinetic penalty is paid. In contrast, heterologous matched linker pairs can 
facilitate chain transfer between modules as efficiently as natural pairs. Second, the 
selectivity associated with linker pair interactions suggests that they might play a key role 
in the transient assembly of functionally paired complexes of the three DEBS proteins 
(Aparicio, J. F., et al, (1994) J. Biol. Ckem. 269, 8524-8528; Pieper, R.,e/ al, (1995) 
Nature 378, 263-266). In support of this hypothesis, we have demonstrated that a peptide 
mimic of the N-terminal linker of DEBS2 can inhibit chain transfer mediated by the 
linker pair at the DEBS1-DEBS2 interface but not by the linker pair at the DEBS2- 
DEBS3 interface (Figure 8). Third, we show that the N-te rminal linker peptide of 
DEBS2 has significant (ca. 50%) helical content (Figure 9). This observation is 
consistent with secondary structure analyses of individual linker sequences, which 
previously indicated a propensity of these sequences to assume coiled-coil 
conformations. 2 It should be emphasized that, in order for a coiled-coil motif to facilitate 
noncovalent docking of two modules, the heterodimer (or heterooligomer) formed by 
associations between the C-terminal linker and N-terminal linker must be 
thermodynamically more favorable than either homodimer. Although the data shown in 
Figure 9 show helical content of a linker, it does not provide evidence for formation.of a 
coiled-coil structure. Thus, direct evidence for the existence of coiled coils at 
intermodular interfaces, as well as their implications for selective chain transfer, remains 
to be obtained 
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[00130] Earlier studies have demonstrated the importance of two additional molecular 
recognition events in controlling the overall programming and specificity of PKSs. First, 
individual modules can discriminate among alternative incoming substrates (Wu, N., et 
al 9 (2000) J. Am. Chem. Soc. 122 y 4847-4852); this selectivity appears to reside within 
the individual ketosynthase domains (Jacobsen, J. R., et al 9 (1997) Science 277, 367-369; 
Chuck, J., et al 9 (1997) Chem. Biol. 4, 757-766). Second, ketosynthase and ACP 
domains appear to have some degree of mutual recognition (Dreier, J., et al, (1999) J. 
Biol Chem. 274, 25108-25112; Ranganathan, A. 9 etal, (1999) Chem. Biol 6, 731-741). 
Both of these recognition properties are localized within highly conserved and 
catalytically critical parts of the large PKS modules. Here we define and dissect a third 
element of selectivity. In contrast to the previously recognized factors influencing 
molecular recognition by PKS components, linker-mediated intermodular interactions 
have been localized to short, nonconserved regions that lie outside the core modules and 
have no influence on the intrinsic chemistry of the individual modules. 

Example 7 

Methods Directed Towards Assessing the Effects of Protein-Protein Interactions and 
Enzvme-Substrate Interactions in the Channeling of Intermediates between Polvketide 

Synthase Modules 

[00131] Construction of Plasmids. The gene encoding ACP4(4) was amplified as an 
Ndel-EcoRI PCR fragment (523 bp) using the primers 5'- 
CCATATGGTGGTCGACCGGCTCG-3'and 5 ' -G AATTCCT A- 
CAGGTCCTCTCCCCC- 3/(sequences complementary to DEBS shown in bold). The 
PCR product was cloned into pET28a (Novagen) to yield plasmid pNW8. Plasmid 
pST157 encodes a bimodular fusion between module 1 of DEBS1 and module 5 of 
DEBS3, with the thioesterase domain fused downstream of module 5 ("M1+M5-KTE"). 
This fusion, which was engineered by taking advantage of the natural, conserved BsdBI 
sites located at the start of the KS domains of modules 2 and 5, also includes the loading 
didomain of DEBS1. The "linker" sequence that covalently bridges the fused modules is 
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the natural sequence between modules 1 and 2, as in DEBS 1 . The fusion junction ' 
between module 5 and the thio esterase domain is identical to that in plasmid pRSG46. 23 
Similarly, plasmid pST92 encodes an "M1+M6+TE" bimodular fusion. Its construction, 
which is completely analogous to that of pST157, involves introduction of this bimodular 
5 PKS gene from pST96, Gokhale, et al., Science 1999, 254, 482-485, as an Ndel-EcdBl 

into pET-21c (Novagen). The construction of genes encoding (5)M2+TE, (3)M3+TE, 
(5)M5+TE, and (5)M6+TE (pRSG64, pRSG64, pRSG46, and pRSG54, respectively) 
• have been described previously, id., as well as the construction of a gene encoding 
(5)M3+TE (pST132). See Tsuji, et al., Biochemistry 2001. 

gio 

£3 [00132] Expression and Purification of Proteins. All individual modules were 

*0 

expressed and purified as previously described, Wu, et al., Am. Chem. Soc. 2000, 122, 

HI 

'Z 4847-4852. The bimodular proteins were expressed as C-terminal His6-tagged fusion 

*P proteins, and their expression and purification schemes were identical to those previously 

p 1 5 described for the individual modules (id.), yielding 0.2 mg/L culture of purified 

|j M1+M5+TE and 1 mg/mL culture of purified M1+M6+TE. ACP4(4) was expressed by 

45 transforming pNW8 into E. coli BL2 1 (DE3) cells (Novagen), which were then grown in 

Q 

|y LB at 37 °C to OD 600 ) 0.7-0.8. BL21(DE3)/pNW8 was induced overnight with 1 mM 

IPTG at 30 °C. The cells were harvested by centrifugation, washed with TE buffer, and 

20 then resuspended in disruption buffer (100 mM NaBbPC^ (jpH 7.2), 100 mM NaCl, 1.2 

mM DTT, 1.2 mM EDTA, 0.7 mM benzamidine, 1 mg/L pepstatin, 1 mg/mL leupeptin, 
and 1 5% glycerol) before lysis by French press. Following removal of the cell debris by 
centrifugation, the supernatant was treated with 0.1% (w/v) PEI to remove nucleic acids 
followed by a 55% (NEL^SC^ precipita-tion. The resulting (NH4)2SC>4 pellet was 

25 resuspended in 100 mM NaH 2 P0 4 (pH 7.2), 2.5 mM DTT, 1 mM EDTA, 20% glycerol 

(buffer A). This suspension was desalted on a PD-10 gel filtration column (Amersham 
Pharmacia Biotech AB) equilibrated with 10 mM imidazole in 50 mM Tris (pH 8.0), 1 M 
NaCl, 20% glycerol (buffer B), and the eluant was loaded at 1 mL/min onto a Flex- 
column (Kontes) packed with 5 mL of Ni NTA-Superflow resin (Qiagen) using a 

30 peristaltic pump. After being washed with 35 mM imidazole in buffer B for ACP4-(4), 
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the His 6 -tagged protein was eluted from the resin with 90 mM imidazole in buffer B. The 
appropriate fractions were concentrated, and the buffers were exchanged to buffer A + 
1.5 M (NH4>2S04 in Centriprep 10 spin columns (Amicon). Using an Akta FLPC system 
(Amersham Pharmacia Biotech AB), the concentrated protein was loaded at 1 mL/min 
onto a XK 16/20 column packed with 30 mL of Phenyl Sepharose High Performance 
resin and equilibrated with the same buffer. A gradient from 750 mM (NH^SCXt to 0 
mM (NHO2SO4 in buffer A was applied which eluted the protein at 0 mM (NILO2SO4. 
The appropriate fractions were concentrated in Centriprep 10 spin columns to yield 
approximately 10-15 mg/L of purified protein which was flash frozen and stored at -80 
°C. The mass of apo-ACP4(4) was confirmed by MALDI-MS (calculated mass: 20492, 
observed mass: 20507). (MW - methionine) was also observed. 

[00133] Synthesis of CoA Thioester Diketides. The carboxylic acids of the diketides 
were synthesized as previously described. Harris, et al., J. Chem. Res. (S) 1998, 6, 283. 
They include the (2S,3R), (2R,3S), (2R,3S), and (2S,3S) diastereomers of 2-methyl-3- 
hydroxy-pentanoic acid. These carboxylic acids were converted to CoA thioesters 5a-d 
under the following conditions. See Belshaw, et al., Science 1999, 284, 486-489; 
Robertson, et al., J. Am. Chem. Soc 1991, 113, 2722-2729. Carboxylic acid (3.4 mg, 26 
jwmol), CoASH (sodium salt, 1.1 equiv, Sigma), and PyBOP (1.5 equiv, Novabiochem) 
were dissolved in 0.39 mL of THF and 0.39 mL of 4% K 2 C0 3 and stirred under argon for 
40 min. The reaction mixture was diluted to up 5 mL with H20 and injected onto a 
Beckman Ultrasphere Cig HPLC column (250 x 10 mm) equilibrated with 50 mM 
NaH 2 P0 4 (pH 4.2) in 10% MeOH/H 2 0. Using a 10 mL/min linear gradient over 30 min 
to 50 mM NaH2P04 (pH 4.2) in 80% MeOH/H 2 0, the CoA thioesters eluted at 55% 
MeOH. After removal of the MeOH on a rotavap, the product was desalted by 
reinjection on the same column equilibrated with 10% MeOH/H 2 0 followed by elution 
with 90% MeOH. The product was lyophilized and verified by MALDI-MS (theoretical 
mass: 881.742; observed mass: 882.191) and *H NMR (500 MHz) in H 2 0. 5a: 0.71 (s, 
3H), 0.85 (s, 3H), 0.86 (t, 3H), 1.08 (d, 3H), 1.48 (m, 2H), 2.38 (t, 2H), 2.76 (m, 1H), 
2.96 (t, 2H), 3.28 (t, 2H), 3.41 (t, 2H), 3.52 (dd, 1H), 3.73 (td, 1H), 3.79 (dd, 1H), 3.98 (s, 
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1H), 420 (t, 2H), 4.55 (t, 1H), 4.79 (m, 2H), 6.13 (d, 1H), 8.21 (s, 1H), 8.51 (s, 1H). 5b: 
0.72 (s, 3H), 0.86 (s, 3H), 0.89 (t, 3H), 1.1 1 (d, 3H), 1.44 (m, 2H), 2.40 (t, 2H), 2.79 (m, 
1H), 2.97 (t, 2H), 3.30 (t, 2H), 3.43 (t, 2H), 3.52 (dd, 1H), 3.75 (td, 1H), 3.80 (dd, 1H), 
3.99 (s, 1H), 4.21 (m, 2H), 4.56 (m, 1H), 4.75 (m, 1H), 4.80 (m, 1H), 6.15 (d, 1H), 824 
(s, 1H), 8.54 (s, 1H). 5c: 0.66 (s, 3H), 0.78 (s, 3H), 0.78 (t, 3H), 0.97 (d, 3H), 126 (m, 
1H), 1.48 (m, 1H), 2.31 (t, 2H), 2.71 (m, 1H), 2.89 (m, 2H), 3.22 (m, 2H), 3.34 (m, 2H), 
3.45 (dd, 1H), 3.59 (dt, 1H), 3.72 (dd, 1H), 3.90 (s, 1H), 4.14 (m, 2H), 4.49 (m, 1H), 4.73 
(m, 1H), 4.84 (m, 1H), 6.09 (d, 1H), 8.25 (s, 1H), 8.50 (s, 1H). 5d: 0.66 (s, 3H), 0.78 (s, 
3H), 0.78 (t, 3H), 0.97 (d, 3H), 1.27 (m, 1H), 1.48 (m, 1H), 2.31 (t, 2H), 2.71 (m, 1H), 
2.89 (m, 2H), 322 (m, 2H), 3.34 (t, 2H), 3.46 (dd, 1H), 3.59 (m, 1H), 3.73 (dd, 1H), 3.90 
(s, 1H), 4.14 (m, 2H), 4.49 (m, 1H), 4.75 (m, 1H), 4.82 (m, 1H), 6.09 (d, 1H), 8.25 (s, 
1H), 8.50 (s, 1H). Concentrations of solutions of CoA thioesters were determined by 
A260 measurement and calibration against known CoA concentration standards. Yield: 
9.6 nmol (37%). 

[00134] Formation of Holo-ACP and Acyl-ACP from Apo-ACP. The 

phosphopantetheinylation reactions were catalyzed by the Sfp phos-phopantetheine 
transferase, see Quadri, et al., Biochemistry 1998, 37, 1585-1595; Weinreb, et al., 
Biochemistry 1998, 37, 1575-1584, under the following conditions: 150 /M apo ACP, 4 
equiv CoASH (lithium salt, Sigma) or acyl-CoA 5a-d, 0.2 equiv Sjtp in 100 mM 
NaH 2 P0 4 (pH 6.6), 10 mM MgCl 2 , 2.5 mM DTT, 20% glycerol at 37 °C for 20 min. 
Excess small molecules and Sfp were removed from the phosphopantetheinlyated ACPs 
by applying the reaction mixture with an Akta FPLC system to a 6 mLResource Q 
column (Amersham Pharmacia Biotech AB) and ehiting with a linear gradient from 0 
mM NaCl to 500 mM NaCl in buffer A- The desired proteins eluted at 220 mM NaCl 
and were concentrated using Centriprep 10 spin columns. Protein concentrations were 
determined using a modified Lowry assay (Sigma), and the masses were confirmed by 
MALDI-MS or + ESI-MS (4a: observed mass = 20945, calculated mass = 20944; 4b: 
observed mass = 20964; 4c: observed mass = 21056; 4d: observed mass = 20992). 
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[00135] Qualitative Substrate Incorporation Assays. The reaction buffer for the 
diketide incoiporation assays contained 400 mM NaH 2 P04 (pH 7.2), 2.5 mM DTT, 1 mM 
ETDA, 20% glyercol (reaction buffer Q. 1 pM module, 20 fM acyl-ACP, 500 jiMdl- 
[2- 14 C]-methylmalonyl CoA (ARC), and 4 mM NADPH (Sigma) were incubated in 20 
fiL of the reaction buffer at 30 °C for 1.5 h. The reactions were either spotted directly 
onto a TLC plate (Whatman 250 fjM silica gel, UV254), or first extracted with EtOAc 
followed by spotting of the organic extracts on a TLC plate. The TLC plates were 
resolved using 60% EtOAc in hexanes, and the radioactive products were visualized on a 
Packard Instantlmager. 

[00136] Verification of Reaction Products. Triketide lactone products 3a and 3b 
derived from 2a (or 4a) and 2b (or 4b), respectively, have been previously verified.26 To 
verify the triketide lactone products 3c and 3d, reaction extracts were purified by 
preparative TLC. The ethyl acetate extracts of the spots corresponding to the triketide 
lactones were concentrated and then derivatized to TMS ethers by incubation with 50 fiL 
q^O-bis-(trimethylsilyl) trifluoroacetamide (Aldrich) for 30 min at room temperature. 
See McPherson, et al., J. Am. Chem. Soc. 1998, 720, 3267-3268. Injection of the sample 
onto a GC-MS yielded fragmentation peaks at molecular weights 73 and 171, 
corresponding to cleavage between the oxygen and silicon atoms, as expected. Mass 
spectral confirmation data of the /?-ketoiactone equivalents of 3c and 3d were obtained 
sans derivatization and by ESI-MS. The elution pattern of the triketide lactones from a 
chiral HPLC column is described below. 

[00137] Determination of kca t Values. The assays for kinetic measurements were 
performed in reaction buffer C and with the same concentrations of NADPH and 14 C- 
methylmalonyl CoA as for the qualitative assays. Saturating concentrations of propionyl- 
CoA were added to and the ACP substrates were excluded from the bimodular reactions. 
To quench the reactions, 20 fiL reaction aliquots were mixed with 80 piL of 12.5% SDS. 
kcat values for the acyl-ACP substrates were determined by measuring steady-state 
saturating rates at multiple substrate concentrations (varying from 40 to 90 fjM 4). For 
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reactions that did not saturate by 90 £M of substrate, the kcat values are reported as lower 
limits. Workup and visualization of the reaction products were identical to those for the 
qualitative assays. 

[00138] Determination of kcat/KM values. The assays for determination of 
(kcat/K^i)rei were performed with two competing substrates in the same reaction under the 
same conditions as described above for the qualitative assays, except the reaction 
volumes were doubled to 40 fiL. The data were fit into the equation where Sa and S B are 
the two competing substrates and P A and Pb are the corresponding products derived from 
S A and Sb, respectively. The unknown, absolute WKm values could then be obtained 
from known, absolute Jc^/Km data that had been derived directly from the initial slopes of 
v versus [S] plots. See McPherson, et al., J. Am. Chem.. Soc. 1998, 120 9 3267-3268. Each 
reaction was done in duplicate at two different ratios of substrate concentrations. The 
reactions were quenched with 120 fiL of 12.5% SDS, and the products were extracted 
with 2 x 300 fiL of EtOAc. The organic extracts were purged of highly polar compounds 
as well as particulates by flash chromatography through 50 f£L of silica gel in a 1-mL 
polypropylene pipet attached to a 3-mm, 0.22-/tm nylon syringe filter (Osmonics, Lie), 
eluting with 1 .5 mL of EtOAc. Following removal of the organic solvents, the residual 
extracts were resuspended in 20 yiL of hexane and loaded onto a 250 x 4.6 mm Chiralpak 
AS column with the corresponding guard column (Daicel Chemical Industries) that had 
been equilibrated with 5% EtOH (Reagent Alcohol, Fischer) in hexane. With a flow rate 
of 0.8 ml Train, the products were separated using a 20 rain gradient (starting at 2 min) 
from 5 to 15% EtOH in hexane. The reduced triketide lactone products 3a-d eluted at 
20.0, 17.0, 21.5, and 18.5 min, respectively. The unreduced triketide lactone products, 
derived from 4c and 4d, eluted at 21 .0 and 19.0 min, respectively. The appropriate 
fractions were collected, and the radio-active products were detected and quantified using 
Formula-989 liquid scintillation cocktail fluid (Packard) on a Beckman LS3801 liquid 
scintillation counter. 
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[00139] Labeling of Holo-ACP4(4) with 14 C-2a Mediated by (5)M2+TE. Holo- 
ACP4(4) (20 fJiM) was incubated with 1 mM [l- 14 C]-labeled 2a (custom synthesized by 
Amersham Pharmacia, specific activity 55 mCi/mmol) and 1 fxM (5)M2+TE in reaction 
buffer C for 10 min at 30 °C. The protein was precipitated with 75% acetone/H 2 0 for 5 
nriin at -80 °C. After washing the pellet with 6.25% (w/v) TCA to remove excess salts 
followed by 500 ixL of 75% acetone/H20 to remove residual, unbound 14 C-2a, the 
precipitated protein was resuspended in 8 fiL of buffer A and 4 [iL of SDS sample buffer, 
and resolved on a 4-20% SDS-PAGE gradient gel (Bio-Rad). The proteins were 
visualized with Coomassie blue stain and dried, and the radioactivity was detected either 
on a Packard Instantlmager or by exposing the gel to X-ray film. 

[00140] Nomenclature. The nomenclature used in this report for proteins containing 
linker regions is similar to that adopted in a previous publication, (id.) Specifically, the 
module of origin of the linker is placed in parentheses either before or after the name of 
the domain or module to which it is attached, depending on whether it is an N- or a C- 
terminal linker, respectively. For example, a fusion protein comprising the ACP domain 
from module 2 of DEBS and the C-terminal linker of module 4 is referred to as ACP2(4); 
likewise, a protein comprising the N-tenninal linker of module 5 fused to module 6 is 
referred to as (5)M6. 

[00141] Construction and Expression of Bimodular Enzymes. Analogous to 
DEBS1+TE described earlier, Cortes, et al., Science 1995, 258, 1487-1489; Kao, et aL, X 
Am. Chem. Soc. 1995, J/7, 9105-9106, M1+M5+TE (module 1 + module 5 + TE) and 
M1+M6+TE are heterologous fusions of DEBS module 1 with DEBS modules 5 and 6, 
respectively. The natural linker between modules 1 and 2 in the wild-type DEBS1 
protein was preserved in each case. In addition, the DEBS thioesterase (TE) domain was 
fused to the C termini of each downstream module to facilitate turnover by catalyzing the 
release of the triketide product. These two proteins were expressed as C-terminally His6- 
tagged proteins and purified on a hydrophobic butyl sepharose column followed by a 
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Resource Q ion-exchange chromatography to yield approximately 0.2 mg/L culture of 
purified M1+M5+TE and 1 mg/L culture of purified M1+M6+TE. 

[00142] Kinetic Analysis of Bimodular Constructs. In earlier studies on the kinetic 
properties of individual modules (Wu, et al., Am. Cherru Soc. 2000, i22, 4847-4852), 
substrates were diffusively presented to the KS domain of each module as free N- 
acetylcysteamine (NAC) thioesters. This can be contrasted with the natural mode of 
chain transfer in a multimodular system, where acyl chains arrive at the KS domain via 
direct transfer from an upstream ACP domain (Figure 4). To explore whether the latter 
mode of substrate incorporation can have kinetic benefits over the former, the rates of 
triketide lactone 3a synthesis by M1+M5+TE and M1+M6+TE were measured in the 
presence of saturating concentrations of propionyl-CoA, methylmalonyl-CoA, and 
NADPH. The kcat values for these two hybrid PKSs were determined to be 3.1 ± 0. 1 and 
4.1 ± 0.4 min* 1 , respectively (Figure 12). These parameters compare well with the 
maximal rate of 4.8 min" 1 for DEBS1+TE (Pieper, et al., Biochemistry 1997, 56, 1846- 
1 85 1). In contrast, we have shown earlier that whereas module 2+TE and module 6+TE 
turn 2a over with comparable rate constants (kcat — 4.6 and 17 min" 1 , respectively), 
module 5+TE is a significantly weaker catalyst for the same reaction (kcat = 0.25 min" 1 ). 
Addition of exogenous 2a to the reaction catalyzed by the bimodular proteins had no 
effect on their overall catalytic rates. The implications of these results to intramodular 
substrate channeling will be evaluated in the discussion section. 

[00143] . Construction and Expression of Individual ACPs. ACP4-(4) includes the 
entire DEBS ACP4 catalytic domain with its natural C-tenninal linker. (The ACP linker 
is defined as the residues between the ACP consensus sequence and the C terminus of the 
polypeptide. See Tsuji, et al., Biochemistry 2001). This gene was expressed as a 20.5 
kDa N-terminally His 6 -tagged protein to preserve the natural sequence of the C-terminal 
linker. ACP4(4) was purified by affinity chromatography on a nickel column followed 
by a hydrophobic phenyl sepharose column to yield approximately 10-15 mg/L culture of 
purified apoprotein. 
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[00144] Chemoenzymatic Synthesis of Acyl-ACPs. Preparations of the CoA 
thioesters of the natural diketide substrate of module 2, its enantiomer, its C-3 epimer, 
and its C-2 epimer (Figurel 1) were carried out as described in the Materials and Methods 
section. Phosphopantetheinylation of apo-ACP4(4) was cata-lyzed by Sfp (Quadri, et al., 
Biochemistry 1998, 57, 1585-1595; Weinreb, et ?1. 9 Biochemistry 1998, 37, 1575-1584), 
(Figure 10C, first reaction) to generate acyl-ACP4(4) adducts 4a~d. Purification of these 
acyl-ACPs, as described in the Materials and Methods section, led to >95% pure 
materials as judged from SDS-PAGE. Complete phospho-pantetheinlyation was verified 
by MALDI-MS or ESI-MS. 



[00145] Qualitative Assays of Diketide Incorporation by Acyl-ACPs. The acyl- 
ACP4(4) adducts 4a-d were incubated individually with (5)M2+TE, (5)M5+TE, and 
(5)M6+TE in the presence of saturating concentrations of l4 C-methylmalonyl CoA 
extender unit and NADPH. For a given acyi-ACP, the products from modules 2+TE, 
5+TE, and 6+TE were expected to be identical (Figure 1 1), since the modules catalyze 
the same set of reactions with identical stereocontrol (albeit normally on very different 
natural substrates). Both 4a and 4b were accepted and extended by all three modules. 
Likewise, the corresponding NAC-thioesters 2a and 2b have been shown to be substrates 
for the three modules. 26 Remarkably, however, 4c and 4d were also substrates for the 
three modules, even though no turnover of the corresponding NAC thioesters 2c and 2d 
was detected in the case of any module, Wu, et al., Am. Chem. Soc. 2000, 722, 4847-4852 
(It should be noted that elongation of 4c and 4d by modules 5+TE and 6+TE yielded . 
minor quantities of unreduced triketide lactones, indicating less efficient 0-ketoreductase 
activity on these two anti-diketide substrates than on the two syn-diketide substrates.). 
Consistent with previous linker studies, 23,27 while all four acyl-ACP adducts were 
observed to be substrates for (5)M3-KTE, no product formation was observed from the 
incubation of any of the acyl-ACP adducts with (3)M3+TE, even though 2a and 2b have 
previously been shown to be readily incorporated and elongated when presented to either 
module 3+TE derivative, Wu, et al., Am.. Chem. Soc. 2000, 722, 4847-4852; Tsuji, et al., 
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Biochemistry 2001 . Thus, matched linker pairs appear to be capable of enhancing the 
efficiency with which otherwise poor substrates can be channeled between modules. 
Conversely, mismatched linkers can present a major barrier to Hie channeling of 
otherwise acceptable substrates between modules. Control studies performed with 2a and 
5a showed that the two compounds are approximately equivalent substrates for the same 
modules (data not shown). From the amount of product detected in these ACP-mediated 
reactions, the efficiency of the PKS-catalyzed reaction could be estimated. Under typical 
assay conditions (20/xM 4 and IfjM (5)M5+TE), 70% of the acyl-ACP was converted 
into triketide lactone in 1 h. Two conclusions can be drawn from this result. First, acyl- 
ACPs are significantly superior substrates to acyl-NAC thioesters. (Typically, millimolar 
concentrations of the NAC thioester must be used to detect comparable amounts of 
product under otherwise similar assay conditions.) Second, the assay system described in 
Figure 1 1 allows for monitoring multiple turnovers of the enzyme. Indeed, as described 
below, in all cases the maximum rates of consumption of the acyl-ACP substrates were 
comparable to or higher than the maximum rates of consumption of their NAC thioester 
counterparts (see below). Therefore, the association of the donor ACP and the acceptor 
module must be transient, and the dissociation rate constant of the ACP from the module 
must be significantly faster than the slowest step in the. module-catalyzed elongation 



[00146] Kinetic Analysis of Incorporation of Diketides from Acyl-ACPs. The 

WKm values for the reactions of 4a and 4b with (5)M2+TE, (5)M5+TE, and (5)M6+TE 
are shown in Figure 13. Since full saturation curves could not be obtained for the ACP- 
bound substrates for technical reasons, the kcat/KM value for 4b was derived by 
competitive assay against 2a (whose absolute kcat/KM value was derived from the initial 
slope of the v versus [S] saturation curve). Likewise, the kcat/KM value for 4a was 
derived by competitive assay against 4b. Several observations are noteworthy regarding 
the data summarized in Figure 13. First, 2a and 4a are significantly better substrates than 
2b and 4b for each module. In addition, the improvement in specificity for an acyl-ACP 
adduct over its NAC thioester counterpart is particularly pronounced in cases where the 
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NAC thioesters are exceptionally poor (e.g., module 2+TE-catalyzed elongation of 4b 
versus 2b, module 5+TE-catalyzed elongation of 4a versus 2a, and especially module 
5+TE-catalyzed elongation of 4b versus 2b). The implications of these observations are 
elaborated in the Discussion section. 

[00147] To quantify the kinetic advantage of channeling in the above assay system, the 
kcat values for the reactions of 4a-d with modules 2+TE, 5+TE, and 6+TE were measured 
(Figure 14). Kinetic measurements were performed at substrate concentrations between 
40 and 90/xM of each acyl-ACP substrate. For reactions that yielded measurable 
quantities of unreduced triketide lactone products (i.e., elongation of 4c and 4d by 
modules 5+TE and 6+TE), both reduced and unreduced products were combined and 
included for calculations of the kinetics parameters. Except for the reaction of 4a with 
module 2+TE, none of the substrates saturate the enzymes in this concentration range, 
and thus, the kcat values are reported as lower bounds. Comparison of the kcat values of 
the acyl-ACP forms of the four diastereomeric diketides with those from the 
corresponding reactions involving NAC thioesters substrates (which are also shown in 
Figure 14) (Wu, et ?l. 9 Am. Chem. Soc. 2000, 122, 4847-4852) affords two interesting 
observations. First, in those cases where the NAC thioester is a reasonably good 
substrate (e.g., module 2+TE- or module 6+TE-catalyzed elongation of 2a and 4a), the 
maximal reaction rates are comparable regardless of whether the acyl chain is bound to 
NAC or an ACP with a matched linker. In contrast, where the NAC thioester is an 
inferior substrate (e.g., module 2+TE-catalyzed elongation of 2b and 4b,M5+TE- 
catalyzed elongation of 2a and 4a, module 5+TE-catalyzed elongation of 2b and 4b, and 
elongation of 4c and 4d by any of the modules), tethering, the same acyl chain to an ACP 
with a matched linker can result in significant improvements in kcat- Second, the maximal 
rate of turnover of 2a and 4a is significantly greater than that of 2b and 4b for all tested 
modules. Again, the implications of these findings are discussed below. 

[00148] Investigation of the Reversibility of the Donor ACP to Acceptor KS 
Transfer Reaction. Ordinarily, the flow of intermediates in a metabolically active PKS 
is vectorial. A possible mechanism for such directionality could be that, once an acceptor 



sd-81206v2 



-50- 



A T E N T 

tty. Dkt: 300622004620 



KS is acylated with the incoming chain, conformational changes in the module prevent 
the pantetheine arm of the donor ACP from accessing the active site again. To test 
whether this may be the case, holo-ACP4(4) was incubated with 14 C-2a in the presence 
and absence of (5)M2+TE (Figure 15). The ACP was radiolabeled only in the reaction 
5 containing (5)-M2+TE. (As expected, (5)M2+TE was also labeled.) When 

methyhnalonyl Co A extender units and NADPH were added to induce catalytic activity 
of the module, (5)M2+TE dependent labeling of ACP4(4) was also observed, but the 
degree of labeling was considerably reduced. Thus, there does not appear to be an 
absolute barrier to the back-transfer of an acyl chain from a KS to the ACP of the 

g 1 0 preceding module. 

O 

j*j [00149] We have previously investigated the substrate specificity of individual 

iy modules of DEBS using diketide substrates activated as N-acetylcysteamine (NAC) 

thioesters (Figure 10A) (id.). The diketides included in the previous study were the four 
* diastereomeric forms of the natural substrate for module 2 (Figure 11). These substrates 

y 15 were assayed against DEBS modules 2, 3, 5, and 6— each with a TE domain fused to the 

£3 

C terminus to facilitate turnover— and steady-state kinetic parameters were determined for 
S each substrate-enzyme combination. The substrate specificity profiles (as reflected in the 

kcat/KM values) for the four enzymes were found to be remarkably similar in that all of the 
modules preferred 2 a over 2b, and, within detection limits, neither of the two anti- 
20 diketides 2c or 2d was observed to be a substrate for any of the modules. 

[00150] The preference of 2a over its enantiomer 2b for all modules was especially 
intriguing in light of the fact that the natural substrates for modules 3 and 6 share more 
structural similarities to 2b than to 2a. One explanation for this discrepancy was that the 
NAC thioester-based assay system (Figure 10A) may not entirely represent the 
25 mechanism of acylation of a multimodular system. While NAC thioesters substrates 

must be loaded diffusively onto the KS of a module (Figure 10A), polyketide 
intermediates are channeled from the ACP of one module to the KS of the downstream 
module via covalent transfer. Substrate channeling in a multimodular system can occur 
either between two modules on the same polypeptide (e.g., between modules 1 and 2; 
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Figure 10B), or between two modules on separate polypeptides (e.g., between modules 2 
and 3; Figure 10C). Further evidence that protein-protein interactions may influence the 
substrate specificities of individual modules emerged from previous experiments 
suggesting, that inter-polyp eptide linkers— defined as the highly variable regions outside 
the consensus sequences of the modules— are involved in mediating selective 
intermodular chain transfer, Gokhale, et al., Science 1999, 284 9 482-485; Tsuji, et al., 
Biochemistry 2001. To investigate the balance of protein-protein interactions and 
enzyme-substrate interactions in controlling polyketide chain elongation, two assay 
systems that take intermodular, interactions into account were used in this study. 

[00151] Kinetic Channeling in Intrapolypeptide Chain Transfer. In the first 
system (Figure 10B), the effect of substrate channeling between two modules within the 
same polypeptide was investigated. More specifically, in the context of the bimodular 
constructs M1+M2+TE (DEBS1+TE), M1+M5+TE, and M1+M6+TE, DEBS modules 2, 
5, and 6 were examined for their abilities to accept and elongate the natural diketide 
intermediate that was passed from a covalently attached module 1 • The turnover number 
of M1+M5+TE was comparable to that of M1+M6+TE or the "wild-type" M1+M2+TE. 
In contrast, when primed diffusively by 2a, the maximum catalytic rate of module 5+TE 
is significantly reduced compared to that of module 2+TE or module 6+TE. This 
disparity indicates that covalent connection of modules can have a beneficial kinetic 
effect and hinted that, in addition to physical channeling of intermediates, multinodular 
PKSs are also capable of kinetic channeling of intermediates. However, since the only 
incoming substrate that could be tested using this assay system was the natural diketide, a 
better assay system was needed to explore the role of kinetic channeling more generally. 

[00152] Kinetic Channeling in Interpolypeptide Chain Transfer. The minimal 
donor protein requirement for substrate channeling to an acceptor module was postulated 
to be an ACP domain with an appropriate C-terminal linker. Therefore, we constructed, 
expressed, and purified the ACP4 domain and its natural C-terminal linker as an 
individual polypeptide. A variety of acyl groups were then covalently attached to the 
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phosphopantetheine arm of holo-ACP4(4) via a chemoenzymatic procedure (Figure 10C, 
first reaction). The resulting acyl-ACP4(4) adducts 4a-d were tested for their ability to 
transfer the attached diketides from ACP4(4) to the KS of an acceptor module (Figure 
10C, second reaction), where they could then undergo standard chain elongation to yield 
a triketide lactone (Figure 10C, third reaction). The small size of the ACP4(4) protein, 
together with high expression levels of soluble protein in Escherichia coli, allowed 
production of reagent quantities of this protein for use as a substrate in multiple turnover 
assays. The requirement of both an ACP and the linker was highlighted by the fact that 
corresponding CoA thioesters exhibited comparable kinetic parameters and that 
mismatched linkers led to a dramatic reduction in turnover efficiency in the case of 
module 3+TE. The latter feature is consistent with the linker hypothesis developed 
earlier. Id. Although the precise Km values for individual acyl-ACP substrates could not 
be measured in many cases, in all cases they were estimated to be approximately 2-3 
orders of magnitude lower than the reported Km values for the corresponding NAC 
thioester reactions (micromolar for acyl-ACPs versus millimolar for acyl~S-NACs), thus 
making acyl-ACPs excellent substrates for individual PKS modules. 

[00153] Implications of the Interpolypeptide Transfer Kinetics Data. The 

establishment of the acyl-ACP-based assay system allowed us to address two important 
questions regarding the relative balance of protein-protein interactions and enzyme- 
substrate interactions in multimodular systems. First, is the universal preference among 
the three tested modules for 2a over 2b preserved when the same substrates are delivered 
as acyl-ACP adducts? And second, under saturation conditions, can kinetic channeling of 
these diketide substrates be observed for any module? 

[00154] As seen in Figure 13, the preference for the (2£,3.R)-diastereomer over its 
enantiomeric (2fl,3jS)-diastereomer by all modules is preserved regardless of whether the 
substrate is loaded by a channeling mechanism or by a diffusive mechanism. This 
conserved preference suggests that the catalytic steps in a given module that discriminate 
between different substrates remain unchanged whether modules are primed diffusively 
or by a channeling mechanism and that for module 2, at least, the most likely source of 
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discrimination is the KS acylation step. Furthermore, the turnover numbers of the 
individual enzyme-catalyzed reactions reported in Figure 14 make a strong case for 
kinetic channeling in multinodular PKSs. When saturating concentrations of 2b are co- 
incubated with M5+TE in the presence of methylmalonyl-Co A and NADPH, the 
elongation rate constant is only 0.02 min'\ In contrast, when the same reaction is 
monitored using 4b as a substrate, the elongation rate constant (kcat) increases at least 25- 
fold. Similarly, a maximal rate increase of greater than 40-fold is observed in the 
elongation of 4a versus 2a by M5+TE. The effect may be even more pronounced for the 
two anti-diketides, whose elongation rates are below detectable limits (<0.01 min" 1 ) when 
presented as NAC thioesters 2c and 2d, but are quite respectable (~ 1 min" 1 ) when 
presented as acyl-ACP adducts 4c and 4d. Of course, if the KM values for the two anti- 
diketides when presented as NAC-thioesters are significantly higher than the solubility- 
limited concentrations that were used in the assay, then the apparent kinetic advantage of 
channeling the anti-diketides could be artificially high. Even so, these overall results 
indicate that channeling dramatically increases the efficacy of poor substrates. In 
addition, the results reported here provide insight into how multinodular PKSs can be so 
remarkably tolerant toward protein engineering, even though individual modules are 
fairly specific catalysts. 

[00155] The Reversibility of ACPn to KSn+1 Transfers. Finally, the ACP-mediated 
strategy for diketide loading onto acceptor modules also enabled us to address the 
question of reversibility of the transacylation reaction between the donor ACP and the 
recipient KS. While co-incubation of 14 C-labeled 2a with holo-ACP4(4) afforded 
essentially no labeling of the ACP, co-incubation of 14 C-labeled 2a with holo-ACP4(4) in 
the presence of (5)M2+TE gave both labeled (5)M2+TE and ACP4(4) (Figure 1 5 A). The 
proposed mechanism for the observed labeling is shown in Figure 15B and requires back- 
transfer of the acyl group from the KS to the ACP of the upstream module. Back-transfer 
was also observed under turnover conditions, albeit at a substantially reduced level. 
Thus, the observed directionality of chain transfer in the context of a multinodular PKS 
that is rapidly turning over appears to arise due to kinetic channeling of these 
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intermediates rather than a ratchet mechanism that explicitly precludes back-transfer. 
However, given the 20-fold excess of ACP to module, the occupancy level of the diketide 
on the ACP is quite low. Consequently, in a PKS where two modules on separate 
polypeptides exist in approximately equimolar ratios, reverse transfer from a downstream 
5 module to an upstream module occurs rarely and only at steps where a significant barrier 

for forward chain transfer is encountered. In contrast, for intermodular chain transfer 
between modules within the same polypeptide, the effective molarity of the donor ACP 
group is significantly higher, and reversible transfer may be more significant (but without 
chemical consequence). The likelihood of intrapolypeptide reverse transfer may explain 
H 10 why the loading didomain shows relatively low selectivity for a propionyl starter unit 

□ versus an acetyl starter unit (2:1), see Lau, et al., Biochemistry 2000, 39, 10514- 10520, 

jjj whereas DEBS1+TE discriminates strongly (32: 1) between the two starter units. See 

Pd Pieper, et al., Biochemistry 1 996, 55, 2054-2060. 

jg [00156] These studies represent the first direct observation of kinetic channeling of 

pl5 intermediates in a modular PKS. Several dramatic examples are presented for both 

y. intrapolypeptide transfers and inteipolypeptide transfers where the maximal rate constant 

P 

„p (kcat) for elongating a particular ketide substrate by a DEBS module increases 1 0- to 

Cj >1 00-fold when the substrate is channeled relative to when it is diffusively presented. 

Linkers are shown to play an important role in kinetic channeling, although the 
20 contribution of other elements, such as the pantetheine arm or protein-protein interactions 

between the donor and recipient modules, cannot be excluded. In addition, our studies 
have also reinforced the fact that, while individual modules are tolerant of stereochemical 
diversity in diketides, they are at the same time fairly specific catalysts. In addition, their 
specificities and recognition features do not necessarily correlate with the structures of 
25 their natural substrates. Finally, we have shown that the transfer step from a donor ACP 

to an acceptor KS is a fundamentally reversible reaction. Structural and more detailed 
mechanistic studies on these remarkable multifunctional catalysts should be particularly 
interesting from the viewpoint of understanding the atomic basis for the phenomena 
described here. 

30 
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[00157] As used herein, the terms "a*', c W, and "any" are each intended to include 
both the singular and plural forms. 

[00158] Numerous modifications may be made to the foregoing systems without 
departing from the basic teachings thereof. Although the present invention has been 
described in substantial detail with reference to one or more specific embodiments, those 
of skill in the art will recognize that changes may be made to the embodiments 
specifically disclosed in this application, yet these modifications and improvements are 
within the scope and spirit of the invention, as set forth in the specification, drawings, and 
claims. All publications or patent documents cited in this specification are incorporated 
herein by reference as if each such publication or document was specifically and 
individually indicated to be incorporated herein by reference. 
[00159] Citation of the above publications or documents is not intended as an 
admission that any of the foregoing is pertinent prior art, not does it constitute any 
admission as to the contents or date of these publications or documents. 
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Claims 

1 . A method to prepare a desired polyketide synthase from individual modules 
which method comprises 

providing successive covalently linked modules that comprise an intrapolyp eptide 
linker (RAL) and successively non-covalently linked modules that comprise 
interpolypeptide linkers (ERL), so as to facilitate the transfer of a nascent polyketide 
chain from the a module of said PKS to a succeeding module of said PKS; and 
assembling said modules. 

2. The method of claim 1 wherein said assembling is by incubating the polypeptides 
which comprise said modules in a reaction mixture. 

3. The method of claim 1 wherein said assembling is by expressing constructs which 
encode said modules in a host cell. 

4. The method of claim 1 wherein at least some of the modules in the desired 
polyketide synthase are derived from a library of Type I PKS modules and wherein at 
least one module in said polyketide synthase is heterologous with respect to the 
remaining modules. 

5. The method of claim 1 wherein each RAL has an amino acid sequence selected 
from the group consisting of those set forth in Figure 3 or a variant thereof and the 
N-terminal portions of each ERL has the amino acid sequence set forth in Figure 3 or a 
variant thereof. 

6. A polyketide synthase prepared by the method of claim 4. 
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7. The polyketide synthase of claim 6 which contains erythromycin modules 1, and 
3-6 and tylosia module 2, and wherein said polyketide chain is transferred from ery 
module 1 to tyl module 2 and then to ery modules 3-6. 



8. Hie polyketide synthase of claim 6 which contains erythromycin modules 1-5 and 
narbomycin module 6, wherein said polyketide chain is passed from ery modules 1-5 to 
nar module 6. 



9. The polyketide synthase of claim 6 which contains modules 1 and 3-6 of 
erythromycin and modules 2-3 of tylosin, spiramycin or niddamycin, wherein said 
polyketide chain is passed from ery module 1 to modules 2-3 of tylosin, spiramycin or 
niddamycin and then to ery modules 3-6. 

10. The polyketide synthase of claim 6 which contains modules 1-3 of tylosin, 
spiramycin or niddamycin and modules 3-6 of erythromycin, and wherein said polyketide 
chain is passed from modules 1-3 of said tylosin, spiramycin or niddamycin to ery 
modules 3-6. 

1 1 . The polyketide synthase of claim 6 which contains a module of tylosin, 
spiramycin or niddamycin and modules 1-2 and 3-6 of erythromycin, wherein said 
polyketide chain is passed from ery modules 1-2 to the tylosin, spiramycin or niddamycin 
module and then to ery modules 3-6. 

12. The polyketide synthase of claim 6 which contains modules 1 and 3-6 of 
erythromycin and module 5 of tylosin, spiramycin or niddamycin having the enoyl 
reductase catalytic activity inactivated, wherein said polyketide chain is passed from ery 
module 1 to module 5 of tylosin, spiramycin or niddamycin and then to ery modules 3-6. 

13. The polyketide synthase of claim 6 which contains erythromycin modules 1-4 
and 6 and module 6 of spiramycin or niddamycin, wherein said polyketide chain is 
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passed from ery modules 1-4 to module 6 of spiramycin or niddamycin and then to ery 
module 6. 



14. The polyketide synthase of claim 6 which contains module 1 of FK-506/520 and 
modules 2-14 of rapamycin, wherein said polyketide chain is passed from module 1 of 
FK-506/520 and then to modules 2-14 of rapamycin. 

15. The polyketide synthase of claim 6 which contains module 1 and 1 1-14 of 
rapamycin and modules 2-6 of FK-506/520 wherein said polyketide chain is passed from 
module 1 of rapamycin to modules 2-6 of FK-506/520 and then to modules 1 1-14 of 
rapamycin. 



16. The polyketide synthase of claim 6 which contains module 1 of rapamycin, 
modules 2-7 of FK-506/520 and modules 12-14 of rapamycin, wherein said polyketide 
chain is passed from module 1 of rapamycin to modules 2-7 of FK-506/520 and then to 
modules 12-14 of rapamycin. 



17. The polyketide synthase of claim 6 which contains module 1 of rapamycin, 
modules 2-8 of FK-506/520 and modules 13-14 of rapamycin, wherein said polyketide 
chain is passed from module 1 of rapamycin to modules 2-8 of FK-506/520 and then to 
modules 13-14 of rapamycin. 



1 8. The polyketide synthase of claim 6 which contains modules 1-10 of rapamycin 
and modules 7-10 of FK-506/520, wherein said polyketide chain is passed from 
modules 1-10 of rapamycin to modules 7-10 of FK-506/520. 

19. A method to prepare a desired polyketide which method comprises incubating 
required substrates with the polyketide synthase of claim 6. 
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20. The method of claim 19 wherein the substrates comprise a diketide thioester and 
thioesters of the required extender units. 

21 . The method of claim 20 wherein the extender units are malonyl, methylmalonjd, 
ethylmalonyl or hydroxymalonyl thioesters. 

22. A polyketide or polyketide derivative synthesized by a method comprising the 
method of claim 19. 
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Abstract of the Disclosure 



Linking sequences which modulate cross-talk between modules of Type I 
polyketide synthases have been identified. Thus, arbitrarily chosen modules can be 
mixed and matched by supplying the appropriate linkers to obtain desired polyketide 
synthases and new polyketides. The modules are provided suitable linkers so that the 
polyketide chain is passed firom one module to the other in the correct sequence. 
Synthetic peptides which mimic linkers can be used to inhibit the synthesis of 
polyketides. Kinetic channeling, both intrapolypeptide and interpolypeptide, of diketide 
intermediates in a Type I polyketide synthase can occur. 
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Figure 2 
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Figure 3 
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Figure 6 
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